Machine learning based analysis of single-cell data reveals evidence of subject-specific single-cell gene expression profiles in acute myeloid leukaemia patients and healthy controls

Biochim Biophys Acta Gene Regul Mech. 2024 Dec;1867(4):195062. doi: 10.1016/j.bbagrm.2024.195062. Epub 2024 Oct 2.

Abstract

Acute Myeloid Leukaemia (AML) is characterized by uncontrolled growth of immature myeloid cells, disrupting normal blood production. Treatment typically involves chemotherapy, targeted therapy, and stem cell transplantation but many patients develop chemoresistance, leading to poor outcomes due to the disease's high heterogeneity. In this study, we used publicly available single-cell RNA sequencing data and machine learning to classify AML patients and healthy, monocytes, dendritic and progenitor cells population. We found that gene expression profiles of AML patients and healthy controls can be classified at the individual level with high accuracy (>70 %) when using progenitor cells, suggesting the existence of subject-specific single cell transcriptomics profiles. The analysis also revealed molecular determinants of patient heterogeneity (e.g. TPSD1, CT45A1, and GABRA4) which could support new strategies for patient stratification and personalized treatment in leukaemia.

Keywords: Classification; Gene regulation; Genomics; Personalized medicine.

MeSH terms

  • Case-Control Studies
  • Gene Expression Profiling
  • Gene Expression Regulation, Leukemic
  • Humans
  • Leukemia, Myeloid, Acute* / genetics
  • Machine Learning*
  • Single-Cell Analysis* / methods
  • Transcriptome*