Predictive modeling has become a distinct subdiscipline of reproductive medicine, and researchers and clinicians are just learning the skills and expertise to evaluate artificial intelligence (AI) studies. Diagnostic tests and model predictions are subject to evaluation. Their use offers potential for both harm and benefit in terms of diagnosis, treatment, and prognosis. The performance of AI models and their potential clinical utility hinge on the quality and size of the databases used, the types and distribution of data, and the particular AI method applied. Additionally, when images are involved, the method of capturing, preprocessing, and treatment and accurate labeling of images becomes an important component of AI modeling. Inconsistent image treatment or inaccurate labeling of images can lead to an inconsistent database, resulting in poor AI accuracy. We discuss the critical appraisal of AI models in reproductive medicine and convey the importance of transparency and standardization in reporting AI models so that the risk of bias and the potential clinical utility of AI can be assessed.
Keywords: Artificial intelligence; artificial neural networks; convolutional neural networks; deep learning; machine learning.
Copyright © 2020 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.