The importance of prediction model validation and assessment in obesity and nutrition research

A E Ivanescu; P Li; B George; A W Brown; S W Keith; D Raju; D B Allison

doi:10.1038/ijo.2015.214

The importance of prediction model validation and assessment in obesity and nutrition research

Int J Obes (Lond). 2016 Jun;40(6):887-94. doi: 10.1038/ijo.2015.214. Epub 2015 Oct 9.

Authors

A E Ivanescu¹, P Li², B George², A W Brown², S W Keith³, D Raju⁴, D B Allison^{2

5}

Affiliations

¹ Department of Mathematical Sciences, Montclair State University, Montclair, NJ, USA.
² Office of Energetics and Nutrition Obesity Research Center, University of Alabama at Birmingham, Birmingham, AL, USA.
³ Division of Biostatistics, Department of Pharmacology and Experimental Therapeutics, Thomas Jefferson University, Philadelphia, PA, USA.
⁴ School of Nursing, University of Alabama at Birmingham, Birmingham, AL, USA.
⁵ Department of Biostatistics, University of Alabama at Birmingham, Birmingham, AL, USA.

Abstract

Deriving statistical models to predict one variable from one or more other variables, or predictive modeling, is an important activity in obesity and nutrition research. To determine the quality of the model, it is necessary to quantify and report the predictive validity of the derived models. Conducting validation of the predictive measures provides essential information to the research community about the model. Unfortunately, many articles fail to account for the nearly inevitable reduction in predictive ability that occurs when a model derived on one data set is applied to a new data set. Under some circumstances, the predictive validity can be reduced to nearly zero. In this overview, we explain why reductions in predictive validity occur, define the metrics commonly used to estimate the predictive validity of a model (for example, coefficient of determination (R(2)), mean squared error, sensitivity, specificity, receiver operating characteristic and concordance index) and describe methods to estimate the predictive validity (for example, cross-validation, bootstrap, and adjusted and shrunken R(2)). We emphasize that methods for estimating the expected reduction in predictive ability of a model in new samples are available and this expected reduction should always be reported when new predictive models are introduced.

Publication types

Review
Research Support, N.I.H., Extramural

MeSH terms

Biomedical Research / methods*
Biomedical Research / standards*
Humans
Models, Statistical
Nutritional Sciences / standards*
Obesity*
Predictive Value of Tests
Reproducibility of Results

Abstract

Publication types

MeSH terms

Grants and funding