Development and external validation of a machine learning model for prediction of survival in undifferentiated pleomorphic sarcoma

Musculoskelet Surg. 2024 Mar;108(1):77-86. doi: 10.1007/s12306-023-00795-w. Epub 2023 Sep 1.

Abstract

Purpose: Machine learning (ML) algorithms to predict cancer survival have recently been reported for a number of sarcoma subtypes, but none have investigated undifferentiated pleomorphic sarcoma (UPS). ML is a powerful tool that has the potential to better prognosticate UPS.

Methods: The Surveillance, Epidemiology, and End Results (SEER) database was queried for cases of histologically confirmed undifferentiated pleomorphic sarcoma (UPS) (n = 665). Patient, tumor, and treatment characteristics were recorded, and ML models were developed to predict 1-, 3-, and 5-year survival. The best performing ML model was externally validated using an institutional cohort of UPS patients (n = 151).

Results: All ML models performed best at the 1-year time point and worst at the 5-year time point. On internal validation within the SEER cohort, the best models had c-statistics of 0.67-0.69 at the 5-year time point. The Multi-Layer Perceptron Neural Network (MLP) model was the best performing model and used for external validation. Similarly, the MLP model performed best at 1-year and worst at 5-year on external validation with c-statistics of 0.85 and 0.81, respectively. The MLP model was well calibrated on external validation. The MLP model has been made publicly available at https://rachar.shinyapps.io/ups_app/ .

Conclusion: Machine learning models perform well for survival prediction in UPS, though this sarcoma subtype may be more difficult to prognosticate than other subtypes. Future studies are needed to further validate the machine learning approach for UPS prognostication.

Keywords: Machine learning; Prognostication; Soft tissue sarcoma; Survival.

MeSH terms

  • Algorithms
  • Humans
  • Machine Learning
  • Sarcoma* / therapy
  • Soft Tissue Neoplasms* / pathology