A quantitative atlas of polyadenylation in five mammals

Genome Res. 2012 Jun;22(6):1173-83. doi: 10.1101/gr.132563.111. Epub 2012 Mar 27.

Abstract

We developed PolyA-seq, a strand-specific and quantitative method for high-throughput sequencing of 3' ends of polyadenylated transcripts, and used it to globally map polyadenylation (polyA) sites in 24 matched tissues in human, rhesus, dog, mouse, and rat. We show that PolyA-seq is as accurate as existing RNA sequencing (RNA-seq) approaches for digital gene expression (DGE), enabling simultaneous mapping of polyA sites and quantitative measurement of their usage. In human, we confirmed 158,533 known sites and discovered 280,857 novel sites (FDR < 2.5%). On average 10% of novel human sites were also detected in matched tissues in other species. Most novel sites represent uncharacterized alternative polyA events and extensions of known transcripts in human and mouse, but primarily delineate novel transcripts in the other three species. A total of 69.1% of known human genes that we detected have multiple polyA sites in their 3'UTRs, with 49.3% having three or more. We also detected polyadenylation of noncoding and antisense transcripts, including constitutive and tissue-specific primary microRNAs. The canonical polyA signal was strongly enriched and positionally conserved in all species. In general, usage of polyA sites is more similar within the same tissues across different species than within a species. These quantitative maps of polyA usage in evolutionarily and functionally related samples constitute a resource for understanding the regulatory mechanisms underlying alternative polyadenylation.

MeSH terms

  • 3' Untranslated Regions
  • Animals
  • Chick Embryo
  • Dogs
  • Evolution, Molecular
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Macaca mulatta / genetics
  • Mammals / genetics*
  • Mice
  • MicroRNAs / genetics
  • Models, Genetic
  • Poly A / genetics*
  • Polyadenylation / genetics*
  • RNA, Untranslated
  • Rats
  • Transcriptome

Substances

  • 3' Untranslated Regions
  • MicroRNAs
  • RNA, Untranslated
  • Poly A

Associated data

  • GEO/GSE30198