Validation of a machine learning algorithm for identifying infants at risk of hypoxic ischaemic encephalopathy in a large unseen data set

Arch Dis Child Fetal Neonatal Ed. 2024 Sep 24:fetalneonatal-2024-327366. doi: 10.1136/archdischild-2024-327366. Online ahead of print.

Abstract

Objective: To validate a hypoxic ischaemic encephalopathy (HIE) prediction algorithm to identify infants at risk of HIE immediately after birth using readily available clinical data.

Design: Secondary review of electronic health record data of term deliveries from January 2017 to December 2021.

Setting: A tertiary maternity hospital.

Patients: Infants >36 weeks' gestation with the following clinical variables available: Apgar Score at 1 min and 5 min, postnatal pH, base deficit, and lactate values taken within 1 hour of birth INTERVENTIONS: Previously trained open-source logistic regression and random forest (RF) prediction algorithms were used to calculate a probability index (PI) for each infant for the occurrence of HIE.

Main outcome: Validation of a machine learning algorithm to identify infants at risk of HIE in the immediate postnatal period.

Results: 1081 had a complete data set available within 1 hour of birth: 76 (6.95%) with HIE and 1005 non-HIE. Of the 76 infants with HIE, 37 were classified as mild, 29 moderate and 10 severe. The best overall accuracy was seen with the RF model. Median (IQR) PI in the HIE group was 0.70 (0.53-0.86) vs 0.05 (0.02-0.15), (p<0.001) in the non-HIE group. The area under the receiver operating characteristics curve for prediction of HIE=0.926 (0.893-0.959, p<0.001). Using a PI cut-off to optimise sensitivity of 0.30, 936 of the 1081 (86.5%) infants were correctly classified.

Conclusion: In a large unseen data set an open-source algorithm could identify infants at risk of HIE in the immediate postnatal period. This may aid focused clinical examination, transfer to tertiary care (if necessary) and timely intervention.

Keywords: Intensive Care Units, Neonatal; Neonatology; Neurology.