Splicing predictions reliably classify different types of alternative splicing

RNA. 2015 May;21(5):813-23. doi: 10.1261/rna.048769.114. Epub 2015 Mar 24.

Abstract

Alternative splicing is a key player in the creation of complex mammalian transcriptomes and its misregulation is associated with many human diseases. Multiple mRNA isoforms are generated from most human genes, a process mediated by the interplay of various RNA signature elements and trans-acting factors that guide spliceosomal assembly and intron removal. Here, we introduce a splicing predictor that evaluates hundreds of RNA features simultaneously to successfully differentiate between exons that are constitutively spliced, exons that undergo alternative 5' or 3' splice-site selection, and alternative cassette-type exons. Surprisingly, the splicing predictor did not feature strong discriminatory contributions from binding sites for known splicing regulators. Rather, the ability of an exon to be involved in one or multiple types of alternative splicing is dictated by its immediate sequence context, mainly driven by the identity of the exon's splice sites, the conservation around them, and its exon/intron architecture. Thus, the splicing behavior of human exons can be reliably predicted based on basic RNA sequence elements.

Keywords: alternative splicing; bioinformatics; splicing predictor; support vector machine.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Alternative Splicing*
  • Animals
  • Computational Biology / methods*
  • Exons
  • Genetic Code
  • Humans
  • Mammals / genetics
  • RNA Splice Sites / genetics*
  • Reproducibility of Results
  • Sequence Analysis, RNA*

Substances

  • RNA Splice Sites