Divergence in DNA Specificity among Paralogous Transcription Factors Contributes to Their Differential In Vivo Binding

Cell Syst. 2018 Apr 25;6(4):470-483.e8. doi: 10.1016/j.cels.2018.02.009. Epub 2018 Mar 28.

Abstract

Paralogous transcription factors (TFs) are oftentimes reported to have identical DNA-binding motifs, despite the fact that they perform distinct regulatory functions. Differential genomic targeting by paralogous TFs is generally assumed to be due to interactions with protein co-factors or the chromatin environment. Using a computational-experimental framework called iMADS (integrative modeling and analysis of differential specificity), we show that, contrary to previous assumptions, paralogous TFs bind differently to genomic target sites even in vitro. We used iMADS to quantify, model, and analyze specificity differences between 11 TFs from 4 protein families. We found that paralogous TFs have diverged mainly at medium- and low-affinity sites, which are poorly captured by current motif models. We identify sequence and shape features differentially preferred by paralogous TFs, and we show that the intrinsic differences in specificity among paralogous TFs contribute to their differential in vivo binding. Thus, our study represents a step forward in deciphering the molecular mechanisms of differential specificity in TF families.

Keywords: E2F; ETS; RUNX; bHLH; differential protein-DNA binding; divergence in DNA specificity; genetic variation; iMADS; paralogous transcription factors.

MeSH terms

  • Binding Sites
  • Gene Expression Regulation / physiology
  • Models, Genetic*
  • Models, Molecular
  • Nucleotide Motifs
  • Sequence Analysis, Protein
  • Transcription Factors / chemistry
  • Transcription Factors / physiology*

Substances

  • Transcription Factors