Deep learning-based cancer survival prognosis from RNA-seq data: approaches and evaluations

Zhi Huang; Travis S Johnson; Zhi Han; Bryan Helm; Sha Cao; Chi Zhang; Paul Salama; Maher Rizkalla; Christina Y Yu; Jun Cheng; Shunian Xiang; Xiaohui Zhan; Jie Zhang; Kun Huang

doi:10.1186/s12920-020-0686-1

Deep learning-based cancer survival prognosis from RNA-seq data: approaches and evaluations

BMC Med Genomics. 2020 Apr 3;13(Suppl 5):41. doi: 10.1186/s12920-020-0686-1.

Authors

Zhi Huang^{1

2

3}, Travis S Johnson^{2

4}, Zhi Han², Bryan Helm², Sha Cao⁵, Chi Zhang^{3

5}, Paul Salama³, Maher Rizkalla³, Christina Y Yu^{2

4}, Jun Cheng^{2

6}, Shunian Xiang^{5

7}, Xiaohui Zhan^{2

7}, Jie Zhang⁵, Kun Huang^{8

9}

Affiliations

¹ School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, 47907, USA.
² Department of Medicine, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.
³ Department of Electrical and Computer Engineering, Indiana University - Purdue University Indianapolis, Indianapolis, IN, 46202, USA.
⁴ Department of Biomedical Informatics, The Ohio State University, Columbus, OH, 43210, USA.
⁵ Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.
⁶ National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen, 518060, China.
⁷ Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Shenzhen University, Shenzhen, 518060, China.
⁸ Department of Medicine, Indiana University School of Medicine, Indianapolis, IN, 46202, USA. [email protected].
⁹ Department of Electrical and Computer Engineering, Indiana University - Purdue University Indianapolis, Indianapolis, IN, 46202, USA. [email protected].

Abstract

Background: Recent advances in kernel-based Deep Learning models have introduced a new era in medical research. Originally designed for pattern recognition and image processing, Deep Learning models are now applied to survival prognosis of cancer patients. Specifically, Deep Learning versions of the Cox proportional hazards models are trained with transcriptomic data to predict survival outcomes in cancer patients.

Methods: In this study, a broad analysis was performed on TCGA cancers using a variety of Deep Learning-based models, including Cox-nnet, DeepSurv, and a method proposed by our group named AECOX (AutoEncoder with Cox regression network). Concordance index and p-value of the log-rank test are used to evaluate the model performances.

Results: All models show competitive results across 12 cancer types. The last hidden layers of the Deep Learning approaches are lower dimensional representations of the input data that can be used for feature reduction and visualization. Furthermore, the prognosis performances reveal a negative correlation between model accuracy, overall survival time statistics, and tumor mutation burden (TMB), suggesting an association among overall survival time, TMB, and prognosis prediction accuracy.

Conclusions: Deep Learning based algorithms demonstrate superior performances than traditional machine learning based models. The cancer prognosis results measured in concordance index are indistinguishable across models while are highly variable across cancers. These findings shedding some light into the relationships between patient characteristics and survival learnability on a pan-cancer level.

Keywords: Cancer prognosis; Cox regression; Deep learning; Survival analysis; Tumor mutation burden.

Publication types

Evaluation Study
Research Support, Non-U.S. Gov't

MeSH terms

Adolescent
Adult
Aged
Aged, 80 and over
Algorithms*
Biomarkers, Tumor / genetics*
Computational Biology / methods*
Deep Learning*
Female
Gene Expression Regulation, Neoplastic*
Gene Regulatory Networks
Humans
Male
Middle Aged
Neoplasms / genetics
Neoplasms / mortality*
Neoplasms / pathology
Prognosis
RNA-Seq / methods*
Survival Rate
Transcriptome
Young Adult

Substances

Biomarkers, Tumor

Grants and funding

U01 CA188547/CA/NCI NIH HHS/United States