Identification of biologically relevant enhancers in human erythroid cells

J Biol Chem. 2013 Mar 22;288(12):8433-8444. doi: 10.1074/jbc.M112.413260. Epub 2013 Jan 22.

Abstract

Identification of cell type-specific enhancers is important for understanding the regulation of programs controlling cellular development and differentiation. Enhancers are typically marked by the co-transcriptional activator protein p300 or by groups of cell-expressed transcription factors. We hypothesized that a unique set of enhancers regulates gene expression in human erythroid cells, a highly specialized cell type evolved to provide adequate amounts of oxygen throughout the body. Using chromatin immunoprecipitation followed by massively parallel sequencing, genome-wide maps of candidate enhancers were constructed for p300 and four transcription factors, GATA1, NF-E2, KLF1, and SCL, using primary human erythroid cells. These data were combined with gene expression analyses, and candidate enhancers were identified. Consistent with their predicted function as candidate enhancers, there was statistically significant enrichment of p300 and combinations of co-localizing erythroid transcription factors within 1-50 kb of the transcriptional start site (TSS) of genes highly expressed in erythroid cells. Candidate enhancers were also enriched near genes with known erythroid cell function or phenotype. Candidate enhancers exhibited moderate conservation with mouse and minimal conservation with nonplacental vertebrates. Candidate enhancers were mapped to a set of erythroid-associated, biologically relevant, SNPs from the genome-wide association studies (GWAS) catalogue of NHGRI, National Institutes of Health. Fourteen candidate enhancers, representing 10 genetic loci, mapped to sites associated with biologically relevant erythroid traits. Fragments from these loci directed statistically significant expression in reporter gene assays. Identification of enhancers in human erythroid cells will allow a better understanding of erythroid cell development, differentiation, structure, and function and provide insights into inherited and acquired hematologic disease.

Publication types

  • Research Support, N.I.H., Intramural

MeSH terms

  • Base Sequence
  • Basic Helix-Loop-Helix Transcription Factors / metabolism
  • Basic Helix-Loop-Helix Transcription Factors / physiology
  • Cells, Cultured
  • Chromatin / genetics
  • Chromatin / metabolism
  • Chromatin Immunoprecipitation
  • Conserved Sequence
  • E1A-Associated p300 Protein / metabolism
  • Enhancer Elements, Genetic*
  • Erythroid Cells / metabolism*
  • GATA1 Transcription Factor / metabolism
  • GATA1 Transcription Factor / physiology
  • Gene Expression Regulation*
  • Genes, Reporter
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Kruppel-Like Transcription Factors / metabolism
  • Kruppel-Like Transcription Factors / physiology
  • Luciferases, Firefly / biosynthesis
  • Luciferases, Firefly / genetics
  • Molecular Sequence Annotation
  • NF-E2 Transcription Factor, p45 Subunit / metabolism
  • NF-E2 Transcription Factor, p45 Subunit / physiology
  • Oligonucleotide Array Sequence Analysis
  • Polymorphism, Single Nucleotide
  • Promoter Regions, Genetic
  • Protein Binding
  • Proto-Oncogene Proteins / metabolism
  • Proto-Oncogene Proteins / physiology
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Sequence Analysis, DNA
  • T-Cell Acute Lymphocytic Leukemia Protein 1
  • Transcriptome

Substances

  • Basic Helix-Loop-Helix Transcription Factors
  • Chromatin
  • GATA1 Transcription Factor
  • GATA1 protein, human
  • Kruppel-Like Transcription Factors
  • NF-E2 Transcription Factor, p45 Subunit
  • NFE2 protein, human
  • Proto-Oncogene Proteins
  • RNA, Messenger
  • T-Cell Acute Lymphocytic Leukemia Protein 1
  • erythroid Kruppel-like factor
  • TAL1 protein, human
  • Luciferases, Firefly
  • E1A-Associated p300 Protein
  • EP300 protein, human