Exon-level estimates improve the detection of differentially expressed genes in RNA-seq studies

RNA Biol. 2021 Nov;18(11):1739-1746. doi: 10.1080/15476286.2020.1868151. Epub 2021 Jan 30.

Abstract

Detection of differentially expressed genes (DEGs) between different biological conditions is a key data analysis step of most RNA-sequencing studies. Conventionally, computational tools have used gene-level read counts as input to test for differential gene expression between sample condition groups. Recently, it has been suggested that statistical testing could be performed with increased power at a lower feature level prior to aggregating the results to the gene level. In this study, we systematically compared the performance of calling the DEGs when using read count data at different levels (gene, transcript, and exon) as input, in the context of two publicly available data sets. Additionally, we tested two different methods for aggregating the lower feature-level p-values to gene-level: Lancaster and empirical Brown's method. Our results show that detection of DEGs is improved compared to the conventional gene-level approach regardless of the lower feature-level used for statistical testing. The overall best balance between accuracy and false discovery rate was obtained using the exon-level approach with empirical Brown's aggregation method, which we provide as a freely available Bioconductor package EBSEA (https://bioconductor.org/packages/release/bioc/html/EBSEA.html).

Keywords: RNA-seq; differentially expressed genes; exon-level read counts; gene-level read counts; transcript compatibility counts; transcript-level read counts.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers, Tumor / genetics
  • Biomarkers, Tumor / metabolism*
  • Exome Sequencing / methods*
  • Exons*
  • Gene Expression Regulation, Neoplastic*
  • Humans
  • Male
  • Prostatic Neoplasms / genetics*
  • Prostatic Neoplasms / pathology
  • RNA-Seq / methods*
  • Software*
  • Transcriptome

Substances

  • Biomarkers, Tumor

Grants and funding

Prof. Elo reports grants from the European Research Council ERC (677943), European Union’s Horizon 2020 research and innovation programme (675395), Academy of Finland (296801, 304995, 310561, 314443, and 329278), Juvenile Diabetes Research Foundation JDRF (2-2013-32), and Sigrid Juselius Foundation, during the conduct of the study. Our research is also supported by the University of Turku, Åbo Akademi University, Turku Graduate School (UTUGS), Biocenter Finland, and ELIXIR Finland.