Survival After Radical Cystectomy for Bladder Cancer: Development of a Fair Machine Learning Model

Samuel Carbunaru; Yassamin Neshatvar; Hyungrok Do; Katie Murray; Rajesh Ranganath; Madhur Nayan

doi:10.2196/63289

Survival After Radical Cystectomy for Bladder Cancer: Development of a Fair Machine Learning Model

JMIR Med Inform. 2024 Dec 13:12:e63289. doi: 10.2196/63289.

Authors

Samuel Carbunaru¹, Yassamin Neshatvar¹, Hyungrok Do², Katie Murray^{1

3}, Rajesh Ranganath^{4

5}, Madhur Nayan^{1

2

3}

Affiliations

¹ Department of Urology, New York University School of Medicine, New York, NY, United States.
² Department of Population Health, New York University School of Medicine, New York, NY, United States.
³ Department of Urology, Bellevue Hospital, New York City Health and Hospitals, New York, NY, United States.
⁴ Center for Data Science, New York University, New York, NY, United States.
⁵ Courant Institute of Mathematical Sciences, New York University, New York, NY, United States.

PMID: 39671594
DOI: 10.2196/63289

Abstract

Background: Prediction models based on machine learning (ML) methods are being increasingly developed and adopted in health care. However, these models may be prone to bias and considered unfair if they demonstrate variable performance in population subgroups. An unfair model is of particular concern in bladder cancer, where disparities have been identified in sex and racial subgroups.

Objective: This study aims (1) to develop a ML model to predict survival after radical cystectomy for bladder cancer and evaluate for potential model bias in sex and racial subgroups; and (2) to compare algorithm unfairness mitigation techniques to improve model fairness.

Methods: We trained and compared various ML classification algorithms to predict 5-year survival after radical cystectomy using the National Cancer Database. The primary model performance metric was the F₁-score. The primary metric for model fairness was the equalized odds ratio (eOR). We compared 3 algorithm unfairness mitigation techniques to improve eOR.

Results: We identified 16,481 patients; 23.1% (n=3800) were female, and 91.5% (n=15,080) were "White," 5% (n=832) were "Black," 2.3% (n=373) were "Hispanic," and 1.2% (n=196) were "Asian." The 5-year mortality rate was 75% (n=12,290). The best naive model was extreme gradient boosting (XGBoost), which had an F₁-score of 0.860 and eOR of 0.619. All unfairness mitigation techniques increased the eOR, with correlation remover showing the highest increase and resulting in a final eOR of 0.750. This mitigated model had F₁-scores of 0.86, 0.904, and 0.824 in the full, Black male, and Asian female test sets, respectively.

Conclusions: The ML model predicting survival after radical cystectomy exhibited bias across sex and racial subgroups. By using algorithm unfairness mitigation techniques, we improved algorithmic fairness as measured by the eOR. Our study highlights the role of not only evaluating for model bias but also actively mitigating such disparities to ensure equitable health care delivery. We also deployed the first web-based fair ML model for predicting survival after radical cystectomy.

Keywords: algorithmic fairness; bias; bladder cancer; fairness; health equity; healthcare disparities; machine learning; model; mortality rate; prediction; radical cystectomy; survival.

©Samuel Carbunaru, Yassamin Neshatvar, Hyungrok Do, Katie Murray, Rajesh Ranganath, Madhur Nayan. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 13.12.2024.

MeSH terms

Aged
Algorithms
Cystectomy* / methods
Female
Humans
Machine Learning*
Male
Middle Aged
Urinary Bladder Neoplasms* / mortality
Urinary Bladder Neoplasms* / surgery