How to Read Articles That Use Machine Learning: Users' Guides to the Medical Literature

Yun Liu; Po-Hsuan Cameron Chen; Jonathan Krause; Lily Peng

doi:10.1001/jama.2019.16489

How to Read Articles That Use Machine Learning: Users' Guides to the Medical Literature

JAMA. 2019 Nov 12;322(18):1806-1816. doi: 10.1001/jama.2019.16489.

Authors

Yun Liu¹, Po-Hsuan Cameron Chen¹, Jonathan Krause¹, Lily Peng¹

Affiliation

¹ Google Health, Palo Alto, California.

PMID: 31714992
DOI: 10.1001/jama.2019.16489

Abstract

In recent years, many new clinical diagnostic tools have been developed using complicated machine learning methods. Irrespective of how a diagnostic tool is derived, it must be evaluated using a 3-step process of deriving, validating, and establishing the clinical effectiveness of the tool. Machine learning-based tools should also be assessed for the type of machine learning model used and its appropriateness for the input data type and data set size. Machine learning models also generally have additional prespecified settings called hyperparameters, which must be tuned on a data set independent of the validation set. On the validation set, the outcome against which the model is evaluated is termed the reference standard. The rigor of the reference standard must be assessed, such as against a universally accepted gold standard or expert grading.

MeSH terms

Algorithms
Humans
Machine Learning*
Models, Theoretical*
Publications
Sensitivity and Specificity