A fusion decision system to identify and grade malnutrition in cancer patients: Machine learning reveals feasible workflow from representative real-world data

Clin Nutr. 2021 Aug;40(8):4958-4970. doi: 10.1016/j.clnu.2021.06.028. Epub 2021 Jul 3.

Abstract

Background and aims: Most nutritional assessment tools are based on pre-defined questionnaires or consensus guidelines. However, it has been postulated that population data can be used directly to develop a solution for assessing malnutrition. This study established a machine learning (ML)-based, individualized decision system to identify and grade malnutrition using large-scale data from cancer patients.

Methods: This was an observational, nationwide, multicenter cohort study that included 14134 cancer patients from five institutions in four different geographic regions of China. Multi-stage K-means clustering was performed to isolate and grade malnutrition based on 17 core nutritional features. The effectiveness of the identified clusters for reflecting clinical characteristics, nutritional status and patient outcomes was comprehensively evaluated. The study population was randomly split for model derivation and validation. Multiple ML algorithms were developed, validated and compared to screen for optimal models to implement the cluster prediction.

Results: A well-nourished cluster (n = 8193, 58.0%) and a malnourished cluster with three phenotype-specific severity levels (mild = 2195, 15.5%; moderate = 2491, 17.6%; severe = 1255, 8.9%) were identified. The clusters showed moderate agreement with the Patient-Generated Subjective Global Assessment and the Global Leadership Initiative on Malnutrition criteria. The severity of malnutrition was negatively associated with the nutritional status, physical status, quality of life, and short-term outcomes, and was monotonically correlated with reduced overall survival. A multinomial logistic regression was found to be the optimal ML algorithm, and models built based on this algorithm showed almost perfect performance to predict the clusters in the validation data.

Conclusions: This study developed a fusion decision system that can be used to facilitate the identification and severity grading of malnutrition in patients with cancer. Moreover, the study workflow is flexible, and might provide a generalizable solution for the artificial intelligence-based assessment of malnutrition in a wider variety of scenarios.

Keywords: Cancer; K-means clustering; Machine learning; Malnutrition; Real-world study.

Publication types

  • Multicenter Study
  • Observational Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Aged
  • China
  • Cluster Analysis
  • Diagnosis, Computer-Assisted / methods*
  • Feasibility Studies
  • Female
  • Functional Status
  • Humans
  • Logistic Models
  • Machine Learning*
  • Male
  • Malnutrition / diagnosis*
  • Malnutrition / etiology
  • Middle Aged
  • Neoplasms / complications*
  • Neoplasms / physiopathology
  • Nutrition Assessment*
  • Nutritional Status
  • Quality of Life
  • Severity of Illness Index
  • Workflow