Probabilistic partitioning methods to find significant patterns in ChIP-Seq data

Bioinformatics. 2014 Sep 1;30(17):2406-13. doi: 10.1093/bioinformatics/btu318. Epub 2014 May 7.

Abstract

Motivation: We have witnessed an enormous increase in ChIP-Seq data for histone modifications in the past few years. Discovering significant patterns in these data is an important problem for understanding biological mechanisms.

Results: We propose probabilistic partitioning methods to discover significant patterns in ChIP-Seq data. Our methods take into account signal magnitude, shape, strand orientation and shifts. We compare our methods with some current methods and demonstrate significant improvements, especially with sparse data. Besides pattern discovery and classification, probabilistic partitioning can serve other purposes in ChIP-Seq data analysis. Specifically, we exemplify its merits in the context of peak finding and partitioning of nucleosome positioning patterns in human promoters.

Availability and implementation: The software and code are available in the supplementary material.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Chromatin Immunoprecipitation / methods*
  • High-Throughput Nucleotide Sequencing / methods*
  • Histones / metabolism*
  • Humans
  • Nucleosomes / metabolism
  • Probability
  • Promoter Regions, Genetic
  • Sequence Analysis, DNA / methods*
  • Software

Substances

  • Histones
  • Nucleosomes