Machine learning identifies a core gene set predictive of acquired resistance to EGFR tyrosine kinase inhibitor

J Cancer Res Clin Oncol. 2018 Aug;144(8):1435-1444. doi: 10.1007/s00432-018-2676-7. Epub 2018 May 25.

Abstract

Purpose: Acquired resistance (AR) to epidermal growth factor receptor tyrosine kinase inhibitors (EGFR-TKIs) is a major issue worldwide, for both patients and healthcare providers. However, precise prediction is currently infeasible due to the lack of an appropriate model. This study was conducted to develop and validate an individualized prediction model for automated detection of acquired EGFR-TKI resistance.

Methods: Penalized regression was applied to construct a predictive model using publically available genomic cohorts of acquired EGFR-TKI resistance. To develop a model with enhanced generalizability, we merged multiple cohorts then updated the learning parameter via robust cross-study validation. Model performance was evaluated mainly using the area under the receiver operating characteristic curve.

Results: Using a multi-study-derived machine learning method, we developed an extremely parsimonious model with generalized predictors (DDK3, CPS1, MOB3B, KRT6A), which has excellent prediction performance on blind cohorts for AR to EGFR-TKIs (gefitinib, erlotinib and afatinib) and monoclonal antibody against EGFR (cetuximab). In addition, our model also showed high performance for predicting intrinsic resistance (IR) to EGFR-TKIs from two large-scale pharmacogenomic resources, the Cancer Genome Project and the Cancer Cell Line Encyclopedia, suggesting that these general predictive features may work across AR and IR.

Conclusions: We successfully constructed a multi-study-derived prediction model for acquired EGFR-TKI resistance with excellent accuracy, generalizability and transferability.

Keywords: Computer modeling; Drug resistance; Epidermal growth factor receptor; Protein tyrosine kinases; Transcriptomics.

MeSH terms

  • Algorithms
  • Cell Line, Tumor
  • Drug Resistance, Neoplasm / genetics
  • ErbB Receptors / antagonists & inhibitors*
  • Humans
  • Machine Learning*
  • Models, Genetic*
  • Neoplasms / drug therapy*
  • Neoplasms / enzymology
  • Neoplasms / genetics*
  • Protein Kinase Inhibitors / pharmacology*
  • Reproducibility of Results
  • Transcriptome

Substances

  • Protein Kinase Inhibitors
  • EGFR protein, human
  • ErbB Receptors