Improving gene function predictions using independent transcriptional components

Nat Commun. 2021 Mar 5;12(1):1464. doi: 10.1038/s41467-021-21671-w.

Abstract

The interpretation of high throughput sequencing data is limited by our incomplete functional understanding of coding and non-coding transcripts. Reliably predicting the function of such transcripts can overcome this limitation. Here we report the use of a consensus independent component analysis and guilt-by-association approach to predict over 23,000 functional groups comprised of over 55,000 coding and non-coding transcripts using publicly available transcriptomic profiles. We show that, compared to using Principal Component Analysis, Independent Component Analysis-derived transcriptional components enable more confident functionality predictions, improve predictions when new members are added to the gene sets, and are less affected by gene multi-functionality. Predictions generated using human or mouse transcriptomic data are made available for exploration in a publicly available web portal.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Computational Biology
  • Gene Expression Profiling / methods*
  • Gene Knockout Techniques
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Mice
  • RNA, Messenger / metabolism
  • Transcriptome*

Substances

  • RNA, Messenger

Associated data

  • figshare/10.6084/m9.figshare.13265159