Prediction of cyanotic and acyanotic congenital heart disease using machine learning models

World J Clin Pediatr. 2024 Dec 9;13(4):98472. doi: 10.5409/wjcp.v13.i4.98472.

Abstract

Background: Congenital heart disease is most commonly seen in neonates and it is a major cause of pediatric illness and childhood morbidity and mortality.

Aim: To identify and build the best predictive model for predicting cyanotic and acyanotic congenital heart disease in children during pregnancy and identify their potential risk factors.

Methods: The data were collected from the Pediatric Cardiology Department at Chaudhry Pervaiz Elahi Institute of Cardiology Multan, Pakistan from December 2017 to October 2019. A sample of 3900 mothers whose children were diagnosed with cyanotic or acyanotic congenital heart disease was taken. Multivariate outlier detection methods were used to identify the potential outliers. Different machine learning models were compared, and the best-fitted model was selected using the area under the curve, sensitivity, and specificity of the models.

Results: Out of 3900 patients included, about 69.5% had acyanotic and 30.5% had cyanotic congenital heart disease. Males had more cases of acyanotic (53.6%) and cyanotic (54.5%) congenital heart disease as compared to females. The odds of having cyanotic was 1.28 times higher for children whose mothers used more fast food frequently during pregnancy. The artificial neural network model was selected as the best predictive model with an area under the curve of 0.9012, sensitivity of 65.76%, and specificity of 97.23%.

Conclusion: Children having a positive family history are at very high risk of having cyanotic and acyanotic congenital heart disease. Males are more at risk and their mothers need more care, good food, and physical activity during pregnancy. The best-fitted model for predicting cyanotic and acyanotic congenital heart disease is the artificial neural network. The results obtained and the best model identified will be useful for medical practitioners and public health scientists for an informed decision-making process about the earlier diagnosis and improve the health condition of children in Pakistan.

Keywords: Acyanotic heart disease; Artificial neural network; Congenital heart disease; Cyanotic heart disease; Logistic regression model.