Objective: To evaluate a machine learning (ML)-based model for pulmonary hypertension (PH) prediction using measurements and impressions made during echocardiography.
Methods: A total of 7853 consecutive patients with right-sided heart catheterization and transthoracic echocardiography performed within 1 week from January 1, 2012, through December 31, 2019, were included. The data were split into training (n=5024 [64%]), validation (n=1275 [16%]), and testing (n=1554 [20%]). A gradient boosting machine with enumerated grid search for optimization was selected to allow missing data in the boosted trees without imputation. The training target was PH, defined by right-sided heart catheterization as mean pulmonary artery pressure above 20 mm Hg; model performance was maximized relative to area under the receiver operating characteristic curve using 5-fold cross-validation.
Results: Cohort age was 64±14 years; 3467 (44%) were female, and 81% (6323/7853) had PH. The final trained model included 19 characteristics, measurements, or impressions derived from the echocardiogram. In the testing data, the model had high discrimination for the detection of PH (area under the receiver operating characteristic curve, 0.83; 95% CI, 0.80 to 0.85). The model's accuracy, sensitivity, positive predictive value, and negative predictive value were 82% (1267/1554), 88% (1098/1242), 89% (1098/1241), and 54% (169/313), respectively.
Conclusion: By use of ML, PH could be predicted on the basis of clinical and echocardiographic variables, without tricuspid regurgitation velocity. Machine learning methods appear promising for identifying patients with low likelihood of PH.
Copyright © 2023 Mayo Foundation for Medical Education and Research. Published by Elsevier Inc. All rights reserved.