The transcriptional landscape of the mammalian genome

Science. 2005 Sep 2;309(5740):1559-63. doi: 10.1126/science.1112014.

Abstract

This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • 3' Untranslated Regions
  • Animals
  • Base Sequence
  • Conserved Sequence
  • DNA, Complementary / chemistry
  • Genome*
  • Genome, Human
  • Genomics
  • Humans
  • Mice / genetics*
  • Promoter Regions, Genetic
  • Proteins / genetics
  • RNA / chemistry
  • RNA / classification
  • RNA Splicing
  • RNA, Untranslated / chemistry
  • Regulatory Sequences, Ribonucleic Acid
  • Terminator Regions, Genetic*
  • Transcription Initiation Site*
  • Transcription, Genetic*

Substances

  • 3' Untranslated Regions
  • DNA, Complementary
  • Proteins
  • RNA, Untranslated
  • Regulatory Sequences, Ribonucleic Acid
  • RNA