Evaluating prediction models in reproductive medicine

S F P J Coppus; F van der Veen; B C Opmeer; B W J Mol; P M M Bossuyt

doi:10.1093/humrep/dep109

Evaluating prediction models in reproductive medicine

Hum Reprod. 2009 Aug;24(8):1774-8. doi: 10.1093/humrep/dep109. Epub 2009 Apr 23.

Authors

S F P J Coppus¹, F van der Veen, B C Opmeer, B W J Mol, P M M Bossuyt

Affiliation

¹ Department of Obstetrics and Gynaecology, Centre for Reproductive Medicine, Academic Medical Centre, Amsterdam, The Netherlands. [email protected]

PMID: 19395365
DOI: 10.1093/humrep/dep109

Abstract

Prediction models are used in reproductive medicine to calculate the probability of pregnancy without treatment, as well as the probability of pregnancy after ovulation induction, intrauterine insemination or in vitro fertilization. The performance of such prediction models is often evaluated with a receiver operating characteristic (ROC) curve. The area under the ROC curve, also known as c-statistic, is then used as a measure of model performance. The value of this c-statistic is low for most prediction models in reproductive medicine. Here, we demonstrate that low values of the c-statistic are to be expected in these prediction models, but we also show that this does not imply that these models are of limited use in clinical practice. The calibration of the model (the correspondence between model-based probabilities and observed pregnancy rates) as well as the availability of a clinically useful distribution of probabilities and the ability to correctly identify the appropriate form of management are more meaningful concepts for model evaluation.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Calibration
Female
Humans
Male
Models, Statistical*
Pregnancy
Pregnancy Outcome*
Probability
Prognosis
ROC Curve
Reproductive Medicine*