Association between serum hypertriglyceridemia and hematological indices: data mining approaches

BMC Med Inform Decis Mak. 2024 Dec 28;24(1):410. doi: 10.1186/s12911-024-02835-2.

Abstract

Background: High triglyceride (TG) affects and is affected of other hematological factors. The determination of serum fasted triglycerides concentrations, as part of a lipid profile, is crucial key point in hematological factors and significantly affect various systemic diseases. This study was carried out to assess the potential relation between the concentration of TG and hematological factors.

Method: Our sample size was 9704 participants beginning in 2007 and ending in 2020 aged between 35 and 65 years, sourced from the MASHAD cohort (northeastern Iran). Machine learning methodologies, specifically logistic regression, decision tree, and random forest algorithms, were utilized for data analysis in the investigation of individuals with normal and high TG levels.

Results: The highest Gini score belongs to RLR (Red cell distribution width/Lymphocyte) (236.10), RPR (Red cell distribution width/Platelets) (215.78), and PHR (Platelets/high-density lipoprotein) (273.66). We also found that factors such as age are statistically associated with the level of TG in women probably due to the drop in menopausal estrogen. RF model showed to have higher accuracy in predicting the TG level in both males and females.

Conclusion: Our model assessed the association between serum TG with several hematological factors like RLR, RPR, and PHR. Other hematological factors also have been reported to be related to the TG level. As these results give us new insights into the association of TG on various hematological factors and their possible interactions with each other. future studies are needed to provide sufficient data for the mechanism and the pathophysiology of the findings.

Keywords: Decision Tree; Hematological factors; Hypertriglyceridemia; Machine learning; Random Forest.

MeSH terms

  • Adult
  • Aged
  • Data Mining*
  • Erythrocyte Indices
  • Female
  • Humans
  • Hypertriglyceridemia* / blood
  • Iran
  • Machine Learning
  • Male
  • Middle Aged
  • Triglycerides / blood

Substances

  • Triglycerides