The importance of prediction model validation and assessment in obesity and nutrition research

Int J Obes (Lond). 2016 Jun;40(6):887-94. doi: 10.1038/ijo.2015.214. Epub 2015 Oct 9.

Abstract

Deriving statistical models to predict one variable from one or more other variables, or predictive modeling, is an important activity in obesity and nutrition research. To determine the quality of the model, it is necessary to quantify and report the predictive validity of the derived models. Conducting validation of the predictive measures provides essential information to the research community about the model. Unfortunately, many articles fail to account for the nearly inevitable reduction in predictive ability that occurs when a model derived on one data set is applied to a new data set. Under some circumstances, the predictive validity can be reduced to nearly zero. In this overview, we explain why reductions in predictive validity occur, define the metrics commonly used to estimate the predictive validity of a model (for example, coefficient of determination (R(2)), mean squared error, sensitivity, specificity, receiver operating characteristic and concordance index) and describe methods to estimate the predictive validity (for example, cross-validation, bootstrap, and adjusted and shrunken R(2)). We emphasize that methods for estimating the expected reduction in predictive ability of a model in new samples are available and this expected reduction should always be reported when new predictive models are introduced.

Publication types

  • Review
  • Research Support, N.I.H., Extramural

MeSH terms

  • Biomedical Research / methods*
  • Biomedical Research / standards*
  • Humans
  • Models, Statistical
  • Nutritional Sciences / standards*
  • Obesity*
  • Predictive Value of Tests
  • Reproducibility of Results