An evolutionarily conserved DNA architecture determines target specificity of the TWIST family bHLH transcription factors

Genes Dev. 2015 Mar 15;29(6):603-16. doi: 10.1101/gad.242842.114. Epub 2015 Mar 11.

Abstract

Basic helix-loop-helix (bHLH) transcription factors recognize the canonical E-box (CANNTG) to regulate gene transcription; however, given the prevalence of E-boxes in a genome, it has been puzzling how individual bHLH proteins selectively recognize E-box sequences on their targets. TWIST is a bHLH transcription factor that promotes epithelial-mesenchymal transition (EMT) during development and tumor metastasis. High-resolution mapping of TWIST occupancy in human and Drosophila genomes reveals that TWIST, but not other bHLH proteins, recognizes a unique double E-box motif with two E-boxes spaced preferentially by 5 nucleotides. Using molecular modeling and binding kinetic analyses, we found that the strict spatial configuration in the double E-box motif aligns two TWIST-E47 dimers on the same face of DNA, thus providing a high-affinity site for a highly stable intramolecular tetramer. Biochemical analyses showed that the WR domain of TWIST dimerizes to mediate tetramer formation, which is functionally required for TWIST-induced EMT. These results uncover a novel mechanism for a bHLH transcription factor to recognize a unique spatial configuration of E-boxes to achieve target specificity. The WR-WR domain interaction uncovered here sets an example of target gene specificity of a bHLH protein being controlled allosterically by a domain outside of the bHLH region.

Keywords: TWIST; WR domain; bHLH transcription factor; epithelial–mesenchymal transition.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Biological Evolution
  • Conserved Sequence
  • Drosophila / chemistry
  • Drosophila / metabolism
  • Drosophila Proteins / chemistry*
  • Drosophila Proteins / metabolism*
  • Gene Expression Regulation
  • Humans
  • Models, Molecular*
  • Protein Binding
  • Protein Stability
  • Protein Structure, Tertiary
  • Substrate Specificity
  • Twist-Related Protein 1 / chemistry*
  • Twist-Related Protein 1 / metabolism*

Substances

  • Drosophila Proteins
  • Twist-Related Protein 1