Spatial distribution of predicted transcription factor binding sites in Drosophila ChIP peaks

Mech Dev. 2016 Aug:141:51-61. doi: 10.1016/j.mod.2016.06.001. Epub 2016 Jun 2.

Abstract

In the development of the Drosophila embryo, gene expression is directed by the sequence-specific interactions of a large network of protein transcription factors (TFs) and DNA cis-regulatory binding sites. Once the identity of the typically 8-10bp binding sites for any given TF has been determined by one of several experimental procedures, the sequences can be represented in a position weight matrix (PWM) and used to predict the location of additional TF binding sites elsewhere in the genome. Often, alignments of large (>200bp) genomic fragments that have been experimentally determined to bind the TF of interest in Chromatin Immunoprecipitation (ChIP) studies are trimmed under the assumption that the majority of the binding sites are located near the center of all the aligned fragments. In this study, ChIP/chip datasets are analyzed using the corresponding PWMs for the well-studied TFs; CAUDAL, HUNCHBACK, KNIRPS and KRUPPEL, to determine the distribution of predicted binding sites. All four TFs are critical regulators of gene expression along the anterio-posterior axis in early Drosophila development. For all four TFs, the ChIP peaks contain multiple binding sites that are broadly distributed across the genomic region represented by the peak, regardless of the prediction stringency criteria used. This result suggests that ChIP peak trimming may exclude functional binding sites from subsequent analyses.

Keywords: Binding sites; ChIP; Drosophila; Transcription factor.

MeSH terms

  • Animals
  • Binding Sites
  • Chromatin Immunoprecipitation
  • Computational Biology
  • DNA-Binding Proteins / genetics*
  • Drosophila Proteins / genetics*
  • Drosophila melanogaster / genetics*
  • Drosophila melanogaster / growth & development
  • Gene Expression Regulation, Developmental
  • Genome, Insect / genetics
  • Homeodomain Proteins / genetics*
  • Kruppel-Like Transcription Factors / genetics*
  • Oligonucleotide Array Sequence Analysis
  • Protein Binding
  • Repressor Proteins / genetics*
  • Transcription Factors / genetics*

Substances

  • DNA-Binding Proteins
  • Drosophila Proteins
  • Homeodomain Proteins
  • Kr protein, Drosophila
  • Kruppel-Like Transcription Factors
  • Repressor Proteins
  • Transcription Factors
  • cad protein, Drosophila
  • hb protein, Drosophila
  • kni protein, Drosophila