Prediction of regulatory targets of alternative isoforms of the epidermal growth factor receptor in a glioblastoma cell line

BMC Bioinformatics. 2019 Aug 22;20(1):434. doi: 10.1186/s12859-019-2944-9.

Abstract

Background: The epidermal growth factor receptor (EGFR) is a major regulator of proliferation in tumor cells. Elevated expression levels of EGFR are associated with prognosis and clinical outcomes of patients in a variety of tumor types. There are at least four splice variants of the mRNA encoding four protein isoforms of EGFR in humans, named I through IV. EGFR isoform I is the full-length protein, whereas isoforms II-IV are shorter protein isoforms. Nevertheless, all EGFR isoforms bind the epidermal growth factor (EGF). Although EGFR is an essential target of long-established and successful tumor therapeutics, the exact function and biomarker potential of alternative EGFR isoforms II-IV are unclear, motivating more in-depth analyses. Hence, we analyzed transcriptome data from glioblastoma cell line SF767 to predict target genes regulated by EGFR isoforms II-IV, but not by EGFR isoform I nor other receptors such as HER2, HER3, or HER4.

Results: We analyzed the differential expression of potential target genes in a glioblastoma cell line in two nested RNAi experimental conditions and one negative control, contrasting expression with EGF stimulation against expression without EGF stimulation. In one RNAi experiment, we selectively knocked down EGFR splice variant I, while in the other we knocked down all four EGFR splice variants, so the associated effects of EGFR II-IV knock-down can only be inferred indirectly. For this type of nested experimental design, we developed a two-step bioinformatics approach based on the Bayesian Information Criterion for predicting putative target genes of EGFR isoforms II-IV. Finally, we experimentally validated a set of six putative target genes, and we found that qPCR validations confirmed the predictions in all cases.

Conclusions: By performing RNAi experiments for three poorly investigated EGFR isoforms, we were able to successfully predict 1140 putative target genes specifically regulated by EGFR isoforms II-IV using the developed Bayesian Gene Selection Criterion (BGSC) approach. This approach is easily utilizable for the analysis of data of other nested experimental designs, and we provide an implementation in R that is easily adaptable to similar data or experimental designs together with all raw datasets used in this study in the BGSC repository, https://github.com/GrosseLab/BGSC .

Keywords: Bayesian Gene Selection Criterion; Bayesian Information Criterion; EGFR; RNAi; Splice variants.

MeSH terms

  • Alternative Splicing / genetics*
  • Bayes Theorem
  • Cell Line, Tumor
  • Computational Biology / methods*
  • ErbB Receptors / genetics*
  • ErbB Receptors / metabolism
  • Glioblastoma / genetics*
  • Humans
  • Probability
  • Protein Isoforms / genetics
  • Protein Isoforms / metabolism
  • RNA Interference
  • RNA, Small Interfering / metabolism
  • Signal Transduction

Substances

  • Protein Isoforms
  • RNA, Small Interfering
  • ErbB Receptors