Machine Learning to Predict the Individual Risk of Treatment-Relevant Toxicity for Patients With Breast Cancer Undergoing Neoadjuvant Systemic Treatment

Lie Cai; Thomas M Deutsch; Chris Sidey-Gibbons; Michelle Kobel; Fabian Riedel; Katharina Smetanay; Carlo Fremd; Laura Michel; Michael Golatta; Joerg Heil; Andreas Schneeweiss; André Pfob

doi:10.1200/CCI.24.00010

Machine Learning to Predict the Individual Risk of Treatment-Relevant Toxicity for Patients With Breast Cancer Undergoing Neoadjuvant Systemic Treatment

JCO Clin Cancer Inform. 2024 Dec:8:e2400010. doi: 10.1200/CCI.24.00010. Epub 2024 Dec 23.

Authors

Lie Cai¹, Thomas M Deutsch¹, Chris Sidey-Gibbons^{2

3}, Michelle Kobel¹, Fabian Riedel¹, Katharina Smetanay^{1

4}, Carlo Fremd^{1

4}, Laura Michel^{1

4}, Michael Golatta^{1

5}, Joerg Heil^{1

5}, Andreas Schneeweiss⁴, André Pfob^{1

2

4}

Affiliations

¹ Department of Obstetrics and Gynecology, Heidelberg University Hospital, Heidelberg, Germany.
² MD Anderson Center for INSPiRED Cancer Care (Integrated Systems for Patient-Reported Data), The University of Texas MD Anderson Cancer Center, Houston, TX.
³ Department of Symptom Research, The University of Texas MD Anderson Cancer Center, Houston, TX.
⁴ National Center for Tumor Diseases, Heidelberg University Hospital and German Cancer Research Center, Heidelberg, Germany.
⁵ Breast Centre Heidelberg, Klinik St Elisabeth, Heidelberg, Germany.

PMID: 39715466
PMCID: PMC11670908 (available on 2025-12-23)
DOI: 10.1200/CCI.24.00010

Abstract

Purpose: Toxicity to systemic cancer treatment represents a major anxiety for patients and a challenge to treatment plans. We aimed to develop machine learning algorithms for the upfront prediction of an individual's risk of experiencing treatment-relevant toxicity during the course of treatment.

Methods: Clinical records were retrieved from a single-center, consecutive cohort of patients who underwent neoadjuvant treatment for early breast cancer. We developed and validated machine learning algorithms to predict grade 3 or 4 toxicity (anemia, neutropenia, deviation of liver enzymes, nephrotoxicity, thrombopenia, electrolyte disturbance, or neuropathy). We used 10-fold cross-validation to develop two algorithms (logistic regression with elastic net penalty [GLM] and support vector machines [SVMs]). Algorithm predictions were compared with documented toxicity events and diagnostic performance was evaluated via area under the curve (AUROC).

Results: A total of 590 patients were identified, 432 in the development set and 158 in the validation set. The median age was 51 years, and 55.8% (329 of 590) experienced grade 3 or 4 toxicity. The performance improved significantly when adding referenced treatment information (referenced regimen, referenced summation dose intensity product) in addition to patient and tumor variables: GLM AUROC 0.59 versus 0.75, P = .02; SVM AUROC 0.64 versus 0.75, P = .01.

Conclusion: The individual risk of treatment-relevant toxicity can be predicted using machine learning algorithms. We demonstrate a promising way to improve efficacy and facilitate proactive toxicity management of systemic cancer treatment.

MeSH terms

Adult
Aged
Algorithms
Breast Neoplasms* / drug therapy
Drug-Related Side Effects and Adverse Reactions / diagnosis
Drug-Related Side Effects and Adverse Reactions / etiology
Female
Humans
Machine Learning*
Middle Aged
Neoadjuvant Therapy* / adverse effects
Prognosis
ROC Curve
Support Vector Machine