dCaP: detecting differential binding events in multiple conditions and proteins

BMC Genomics. 2014;15 Suppl 9(Suppl 9):S12. doi: 10.1186/1471-2164-15-S9-S12. Epub 2014 Dec 8.

Abstract

Background: Current ChIP-seq studies are interested in comparing multiple epigenetic profiles across several cell types and tissues simultaneously for studying constitutive and differential regulation. Simultaneous analysis of multiple epigenetic features in many samples can gain substantial power and specificity than analyzing individual features and/or samples separately. Yet there are currently few tools can perform joint inference of constitutive and differential regulation in multi-feature-multi-condition contexts with statistical testing. Existing tools either test regulatory variation for one factor in multiple samples at a time, or for multiple factors in one or two samples. Many of them only identify binary rather than quantitative variation, which are sensitive to threshold choices.

Results: We propose a novel and powerful method called dCaP for simultaneously detecting constitutive and differential regulation of multiple epigenetic factors in multiple samples. Using simulation, we demonstrate the superior power of dCaP compared to existing methods. We then apply dCaP to two datasets from human and mouse ENCODE projects to demonstrate its utility. We show in the human dataset that the cell-type specific regulatory loci detected by dCaP are significantly enriched near genes with cell-type specific functions and disease relevance. We further show in the mouse dataset that dCaP captures genomic regions showing significant signal variations for TAL1 occupancy between two mouse erythroid cell lines. The novel TAL1 occupancy loci detected only by dCaP are highly enriched with GATA1 occupancy and differential gene expression, while those detected only by other methods are not.

Conclusions: Here, we developed a novel approach to utilize the cooperative property of proteins to detect differential binding given multivariate ChIP-seq samples to provide better power, aiming for complementing existing approaches and providing new insights in the method development in this field.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Animals
  • Cell Line, Tumor
  • Computational Biology / methods*
  • DNA Copy Number Variations
  • Epigenesis, Genetic*
  • Humans
  • Likelihood Functions
  • Mice
  • Protein Binding
  • Proteins / genetics
  • Proteins / metabolism*

Substances

  • Proteins