Genome-wide identification of regulatory DNA elements and protein-binding footprints using signatures of open chromatin in Arabidopsis

Plant Cell. 2012 Jul;24(7):2719-31. doi: 10.1105/tpc.112.098061. Epub 2012 Jul 5.

Abstract

Gene expression and regulation in eukaryotes is controlled by orchestrated binding of regulatory proteins, including both activators and repressors, to promoters and other cis-regulatory DNA elements. An increasing number of plant genomes have been sequenced; however, a similar effort to the ENCODE project, which aimed to identify all functional elements in the human genome, has yet to be initiated in plants. Here we report genome-wide high-resolution mapping of DNase I hypersensitive (DH) sites in the model plant Arabidopsis thaliana. We identified 38,290 and 41,193 DH sites in leaf and flower tissues, respectively. The DH sites were depleted of bulk nucleosomes and were tightly associated with RNA polymerase II binding sites. Approximately 90% of the binding sites of two well-characterized MADS domain transcription factors, APETALA1 and SEPALLATA3, were covered by the DH sites. We demonstrate that protein binding footprints within a specific genomic region can be revealed using the DH site data sets in combination with known or putative protein binding motifs and gene expression data sets. Thus, genome-wide DH site mapping will be an important tool for systematic identification of all cis-regulatory DNA elements in plants.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis / genetics*
  • Arabidopsis / metabolism
  • Arabidopsis Proteins / genetics*
  • Arabidopsis Proteins / metabolism
  • Binding Sites
  • Chromatin / genetics*
  • Chromosome Mapping
  • Chromosomes, Plant / genetics
  • Deoxyribonuclease I / metabolism
  • Flowers / genetics
  • Flowers / metabolism
  • Gene Expression Regulation, Plant / genetics
  • Genome, Plant / genetics*
  • Homeodomain Proteins / genetics
  • Homeodomain Proteins / metabolism
  • MADS Domain Proteins / genetics
  • MADS Domain Proteins / metabolism
  • Mutation
  • Nucleosomes / genetics
  • Nucleotide Motifs
  • Organ Specificity
  • Plant Leaves / genetics
  • Plant Leaves / metabolism
  • Protein Footprinting
  • RNA Polymerase II / genetics
  • RNA Polymerase II / metabolism
  • Regulatory Sequences, Nucleic Acid / genetics*
  • Transcription Factors / genetics
  • Transcription Factors / metabolism

Substances

  • AP1 protein, Arabidopsis
  • Arabidopsis Proteins
  • Chromatin
  • Homeodomain Proteins
  • MADS Domain Proteins
  • Nucleosomes
  • SEP3 protein, Arabidopsis
  • Transcription Factors
  • RNA Polymerase II
  • Deoxyribonuclease I

Associated data

  • GEO/GSE34318