FastHiC: a fast and accurate algorithm to detect long-range chromosomal interactions from Hi-C data

Bioinformatics. 2016 Sep 1;32(17):2692-5. doi: 10.1093/bioinformatics/btw240. Epub 2016 May 3.

Abstract

Motivation: How chromatin folds in three-dimensional (3D) space is closely related to transcription regulation. As powerful tools to study such 3D chromatin conformation, the recently developed Hi-C technologies enable a genome-wide measurement of pair-wise chromatin interaction. However, methods for the detection of biologically meaningful chromatin interactions, i.e. peak calling, from Hi-C data, are still under development. In our previous work, we have developed a novel hidden Markov random field (HMRF) based Bayesian method, which through explicitly modeling the non-negligible spatial dependency among adjacent pairs of loci manifesting in high resolution Hi-C data, achieves substantially improved robustness and enhanced statistical power in peak calling. Superior to peak callers that ignore spatial dependency both methodologically and in performance, our previous Bayesian framework suffers from heavy computational costs due to intensive computation incurred by modeling the correlated peak status of neighboring loci pairs and the inference of hidden dependency structure.

Results: In this work, we have developed FastHiC, a novel approach based on simulated field approximation, which approximates the joint distribution of the hidden peak status by a set of independent random variables, leading to more tractable computation. Performance comparisons in real data analysis showed that FastHiC not only speeds up our original Bayesian method by more than five times, bus also achieves higher peak calling accuracy.

Availability and implementation: FastHiC is freely accessible at:http://www.unc.edu/∼yunmli/FastHiC/ CONTACTS: : [email protected] or [email protected]

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Algorithms*
  • Bayes Theorem
  • Chromatin*
  • Chromosome Mapping
  • Chromosomes
  • Computer Simulation
  • Genome
  • Models, Theoretical
  • Software

Substances

  • Chromatin