A map of the cis-regulatory sequences in the mouse genome

Nature. 2012 Aug 2;488(7409):116-20. doi: 10.1038/nature11243.

Abstract

The laboratory mouse is the most widely used mammalian model organism in biomedical research. The 2.6 × 10(9) bases of the mouse genome possess a high degree of conservation with the human genome, so a thorough annotation of the mouse genome will be of significant value to understanding the function of the human genome. So far, most of the functional sequences in the mouse genome have yet to be found, and the cis-regulatory sequences in particular are still poorly annotated. Comparative genomics has been a powerful tool for the discovery of these sequences, but on its own it cannot resolve their temporal and spatial functions. Recently, ChIP-Seq has been developed to identify cis-regulatory elements in the genomes of several organisms including humans, Drosophila melanogaster and Caenorhabditis elegans. Here we apply the same experimental approach to a diverse set of 19 tissues and cell types in the mouse to produce a map of nearly 300,000 murine cis-regulatory sequences. The annotated sequences add up to 11% of the mouse genome, and include more than 70% of conserved non-coding sequences. We define tissue-specific enhancers and identify potential transcription factors regulating gene expression in each tissue or cell type. Finally, we show that much of the mouse genome is organized into domains of coordinately regulated enhancers and promoters. Our results provide a resource for the annotation of functional elements in the mammalian genome and for the study of mechanisms regulating tissue-specific gene expression.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acetylation
  • Animals
  • Chromatin / metabolism
  • Chromatin Immunoprecipitation
  • Conserved Sequence
  • Enhancer Elements, Genetic / genetics
  • Evolution, Molecular
  • Gene Expression Regulation / genetics*
  • Genome / genetics*
  • Male
  • Methylation
  • Mice / genetics*
  • Mice, Inbred C57BL
  • Molecular Sequence Annotation
  • Nucleotide Motifs
  • Organ Specificity
  • Physical Chromosome Mapping*
  • Promoter Regions, Genetic / genetics
  • Regulatory Sequences, Nucleic Acid / genetics*
  • Sequence Analysis, DNA
  • Transcription Factors / metabolism

Substances

  • Chromatin
  • Transcription Factors

Associated data

  • GEO/GSE29184