Comparing machine learning techniques for neonatal mortality prediction: insights from a modeling competition

Brynne A Sullivan; Alvaro G Moreira; Ryan M McAdams; Lindsey A Knake; Ameena Husain; Jiaxing Qiu; Avinash Mudireddy; Abrar Majeedi; Wissam Shalish; Douglas E Lake; Zachary A Vesoulis

doi:10.1038/s41390-024-03773-5

Comparing machine learning techniques for neonatal mortality prediction: insights from a modeling competition

Pediatr Res. 2024 Dec 16. doi: 10.1038/s41390-024-03773-5. Online ahead of print.

Authors

Brynne A Sullivan^#¹, Alvaro G Moreira^#², Ryan M McAdams^#^{3

4}, Lindsey A Knake^#⁵, Ameena Husain^#⁶, Jiaxing Qiu⁷, Avinash Mudireddy⁸, Abrar Majeedi³, Wissam Shalish⁹, Douglas E Lake⁷, Zachary A Vesoulis¹⁰

Affiliations

¹ University of Virginia, Department of Pediatrics, Division of Neonatology, Charlottesville, VA, USA. [email protected].
² University of Texas Health San Antonio, Department of Pediatrics, Division of Neonatology, San Antonio, TX, USA.
³ University of Wisconsin-Madison, Department of Biostatistics and Medical Informatics, Madison, WI, USA.
⁴ University of Wisconsin-Madison, Department of Pediatrics, Division of Neonatology, Madison, WI, USA.
⁵ University of Iowa, Department of Pediatrics, Division of Neonatology, Iowa City, IA, USA.
⁶ University of Utah, Department of Pediatrics, Division of Neonatology, Salt Lake City, UT, USA.
⁷ University of Virginia, Department of Pediatrics, Division of Neonatology, Charlottesville, VA, USA.
⁸ University of Iowa, Iowa Initiative for Artificial Intelligence, Iowa City, IA, USA.
⁹ Research Institute of the McGill University Health Center, Montreal Children's Hospital, Department of Pediatrics, Division of Neonatology, Montreal, Canada.
¹⁰ Washington University in St. Louis, Department of Pediatrics, Division of Newborn Medicine, St. Louis, MO, USA.

^# Contributed equally.

PMID: 39681666
DOI: 10.1038/s41390-024-03773-5

Abstract

Background: Predicting mortality risk in neonatal intensive care units (NICUs) is challenging due to complex, variable clinical and physiological data. Machine learning (ML) offers potential for more accurate risk stratification.

Objective: To compare the performance of various ML models in predicting NICU mortality using a team-based modeling competition.

Methods: We conducted a modeling competition with five neonatologist-led teams applying ML techniques-logistic regression, CatBoost, neural networks, random forest, and XGBoost-to a shared dataset from over 6,000 NICU admissions. The dataset included static demographic and clinical variables, alongside daily samples of heart rate and oxygen saturation. Each team developed models to predict mortality risk at baseline and within 7 days. Models were evaluated using the area under the receiver operator characteristic curve (AUC). Results were presented at a national meeting, where an audience poll ranked models before AUC results were revealed.

Results: The audience favored the most complex model (CNN) for real-world application, though logistic regression achieved the highest AUC on test data. Teams employed varied feature selection, tuning, and evaluation strategies.

Conclusions: Logistic regression outperformed more complex models, highlighting the importance of selecting modeling methods based on data characteristics, interpretability, and expertise rather than model complexity alone.

Impact: By demonstrating that model complexity does not necessarily equate to better predictive performance, this research encourages the careful selection of modeling approaches.