Discovery of directional and nondirectional pioneer transcription factors by modeling DNase profile magnitude and shape

Nat Biotechnol. 2014 Feb;32(2):171-178. doi: 10.1038/nbt.2798. Epub 2014 Jan 19.

Abstract

We describe protein interaction quantitation (PIQ), a computational method for modeling the magnitude and shape of genome-wide DNase I hypersensitivity profiles to identify transcription factor (TF) binding sites. Through the use of machine-learning techniques, PIQ identified binding sites for >700 TFs from one DNase I hypersensitivity analysis followed by sequencing (DNase-seq) experiment with accuracy comparable to that of chromatin immunoprecipitation followed by sequencing (ChIP-seq). We applied PIQ to analyze DNase-seq data from mouse embryonic stem cells differentiating into prepancreatic and intestinal endoderm. We identified 120 and experimentally validated eight 'pioneer' TF families that dynamically open chromatin. Four pioneer TF families only opened chromatin in one direction from their motifs. Furthermore, we identified 'settler' TFs whose genomic binding is principally governed by proximity to open chromatin. Our results support a model of hierarchical TF binding in which directional and nondirectional pioneer activity shapes the chromatin landscape for population by settler TFs.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Binding Sites / genetics*
  • Chromatin
  • Computational Biology / methods*
  • Deoxyribonucleases / chemistry
  • Deoxyribonucleases / genetics*
  • Deoxyribonucleases / metabolism
  • Models, Genetic*
  • Protein Binding
  • Transcription Factors / chemistry
  • Transcription Factors / genetics*
  • Transcription Factors / metabolism

Substances

  • Chromatin
  • Transcription Factors
  • Deoxyribonucleases

Associated data

  • GEO/GSE53776