Sequence conservation, relative isoform frequencies, and nonsense-mediated decay in evolutionarily conserved alternative splicing

Proc Natl Acad Sci U S A. 2005 Sep 6;102(36):12813-8. doi: 10.1073/pnas.0506139102. Epub 2005 Aug 25.

Abstract

Studies of expressed sequence tag data sets have revealed large numbers of splicing variants for human genes, but it remains challenging to distinguish functionally important variants from aberrant splicing, clarify the nature of the alternative functions, and understand the signals that regulate splicing choices. To help address these issues, we have constructed and analyzed a large data set of 1,478 exon-skipping alternative splicing (AS) variants evolutionarily conserved in human and mouse. In about one-fifth of cases, one isoform appears subject to nonsense-mediated mRNA decay (NMD), supporting the idea that a major role of AS is to regulate gene expression; one-quarter of these NMD-inducing cases involve a conserved exon whose apparent sole purpose is to mediate destruction of the message when included. We explore sequence conservation likely related to splicing regulation, using in part a measure of the overall amount of conserved information in a sequence, and find that the increased conservation that has been observed within AS exons primarily affects synonymous sites, suggesting that regulatory signals significantly constrain synonymous substitution rates. We show that a lower frequency of the inclusion isoform relative to the exclusion isoform tends to be associated with weaker splice site signals, smaller exon size, and higher intronic sequence conservation, and provide evidence that all of these factors are under selection to control relative isoform frequencies. Some conserved instances of AS appear to represent aberrant splicing events that by chance have occurred in both species, and we develop a nonparametric likelihood approach to identify these.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing / genetics*
  • Animals
  • Conserved Sequence / genetics*
  • Evolution, Molecular*
  • Exons / genetics
  • Humans
  • Linear Models
  • Protein Isoforms / genetics*
  • Proteins / genetics*
  • RNA Splice Sites / genetics
  • RNA Stability / genetics*
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism

Substances

  • Protein Isoforms
  • Proteins
  • RNA Splice Sites
  • RNA, Messenger