Analysis of sequence variation underlying tissue-specific transcription factor binding and gene expression

Hum Mutat. 2013 Aug;34(8):1140-8. doi: 10.1002/humu.22343. Epub 2013 Jun 18.

Abstract

Although mutations causing monogenic disorders most frequently lie within the affected gene, sequence variation in complex disorders is more commonly found in noncoding regions. Furthermore, recent genome- wide studies have shown that common DNA sequence variants in noncoding regions are associated with "normal" variation in gene expression resulting in cell-specific and/or allele-specific differences. The mechanism by which such sequence variation causes changes in gene expression is largely unknown. We have addressed this by studying natural variation in the binding of key transcription factors (TFs) in the well-defined, purified cell system of erythropoiesis. We have shown that common polymorphisms frequently directly perturb the binding sites of key TFs, and detailed analysis shows how this causes considerable (~10-fold) changes in expression from a single allele in a tissue-specific manner. We also show how a SNP, located at some distance from the recognized TF binding site, may affect the recruitment of a large multiprotein complex and alter the associated chromatin modification of the variant regulatory element. This study illustrates the principles by which common sequence variation may cause changes in tissue-specific gene expression, and suggests that such variation may underlie an individual's propensity to develop complex human genetic diseases.

Keywords: allele specific; polymorphism; transcription factor binding; transcriptional regulation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Binding Sites / genetics
  • Erythroid Cells / metabolism*
  • Gene Expression*
  • Genetic Variation
  • Genome-Wide Association Study
  • Humans
  • Intracellular Signaling Peptides and Proteins / chemistry
  • Intracellular Signaling Peptides and Proteins / genetics*
  • Intracellular Signaling Peptides and Proteins / metabolism
  • Molecular Sequence Data
  • Nucleoside Diphosphate Kinase D / genetics*
  • Nucleoside Diphosphate Kinase D / metabolism*
  • Polymorphism, Single Nucleotide*
  • Protein Binding
  • Regulatory Sequences, Nucleic Acid
  • Transcription Factors / metabolism*

Substances

  • Intracellular Signaling Peptides and Proteins
  • STIL protein, human
  • Transcription Factors
  • NME4 protein, human
  • Nucleoside Diphosphate Kinase D