Leveraging transcript quantification for fast computation of alternative splicing profiles

Gael P Alamancos; Amadís Pagès; Juan L Trincado; Nicolás Bellora; Eduardo Eyras

doi:10.1261/rna.051557.115

Leveraging transcript quantification for fast computation of alternative splicing profiles

RNA. 2015 Sep;21(9):1521-31. doi: 10.1261/rna.051557.115. Epub 2015 Jul 15.

Authors

Gael P Alamancos¹, Amadís Pagès², Juan L Trincado¹, Nicolás Bellora³, Eduardo Eyras⁴

Affiliations

¹ Universitat Pompeu Fabra, E08003 Barcelona, Spain.
² Universitat Pompeu Fabra, E08003 Barcelona, Spain Centre for Genomic Regulation, E08003 Barcelona, Spain.
³ INIBIOMA, CONICET-UNComahue, Bariloche, 8400 Río Negro, Argentina.
⁴ Universitat Pompeu Fabra, E08003 Barcelona, Spain Catalan Institution for Research and Advanced Studies, E08010 Barcelona, Spain.

Abstract

Alternative splicing plays an essential role in many cellular processes and bears major relevance in the understanding of multiple diseases, including cancer. High-throughput RNA sequencing allows genome-wide analyses of splicing across multiple conditions. However, the increasing number of available data sets represents a major challenge in terms of computation time and storage requirements. We describe SUPPA, a computational tool to calculate relative inclusion values of alternative splicing events, exploiting fast transcript quantification. SUPPA accuracy is comparable and sometimes superior to standard methods using simulated as well as real RNA-sequencing data compared with experimentally validated events. We assess the variability in terms of the choice of annotation and provide evidence that using complete transcripts rather than more transcripts per gene provides better estimates. Moreover, SUPPA coupled with de novo transcript reconstruction methods does not achieve accuracies as high as using quantification of known transcripts, but remains comparable to existing methods. Finally, we show that SUPPA is more than 1000 times faster than standard methods. Coupled with fast transcript quantification, SUPPA provides inclusion values at a much higher speed than existing methods without compromising accuracy, thereby facilitating the systematic splicing analysis of large data sets with limited computational resources. The software is implemented in Python 2.7 and is available under the MIT license at https://bitbucket.org/regulatorygenomicsupf/suppa.

Keywords: RNA-seq; splicing; splicing event.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Alternative Splicing*
Animals
Computational Biology / methods*
Computer Simulation
Gene Expression Profiling / methods*
Humans
RNA / metabolism*
Sequence Analysis, RNA
Software

Substances

RNA