Machine learning analysis for the association between breast feeding and metabolic syndrome in women

Sci Rep. 2024 Feb 20;14(1):4138. doi: 10.1038/s41598-024-53137-6.

Abstract

This cross-sectional study aimed to develop and validate population-based machine learning models for examining the association between breastfeeding and metabolic syndrome in women. The artificial neural network, the decision tree, logistic regression, the Naïve Bayes, the random forest and the support vector machine were developed and validated to predict metabolic syndrome in women. Data came from 30,204 women, who aged 20 years or more and participated in the Korean National Health and Nutrition Examination Surveys 2010-2019. The dependent variable was metabolic syndrome. The 86 independent variables included demographic/socioeconomic determinants, cardiovascular disease, breastfeeding duration and other medical/obstetric information. The random forest had the best performance in terms of the area under the receiver-operating-characteristic curve, e.g., 90.7%. According to random forest variable importance, the top predictors of metabolic syndrome included body mass index (0.1032), medication for hypertension (0.0552), hypertension (0.0499), cardiovascular disease (0.0453), age (0.0437) and breastfeeding duration (0.0191). Breastfeeding duration is a major predictor of metabolic syndrome for women together with body mass index, diagnosis and medication for hypertension, cardiovascular disease and age.

MeSH terms

  • Bayes Theorem
  • Breast Feeding
  • Cardiovascular Diseases*
  • Cross-Sectional Studies
  • Female
  • Humans
  • Hypertension*
  • Machine Learning
  • Metabolic Syndrome* / epidemiology