mHapTk: a comprehensive toolkit for the analysis of DNA methylation haplotypes

Bioinformatics. 2022 Nov 15;38(22):5141-5143. doi: 10.1093/bioinformatics/btac650.

Abstract

Summary: Bisulfite sequencing remains the gold standard technique to detect DNA methylation profiles at single-nucleotide resolution. The DNA methylation status of CpG sites on the same fragment represents a discrete methylation haplotype (mHap). The mHap-level metrics were demonstrated to be promising cancer biomarkers and explain more gene expression variation than average methylation. However, most existing tools focus on average methylation and neglect mHap patterns. Here, we present mHapTk, a comprehensive python toolkit for the analysis of DNA mHap. It calculates eight mHap-level summary statistics in predefined regions or across individual CpG in a genome-wide manner. It identifies methylation haplotype blocks, in which methylations of pairwise CpGs are tightly correlated. Furthermore, mHap patterns can be visualized with the built-in functions in mHapTk or external tools such as IGV and deepTools.

Availability and implementation: https://jiantaoshi.github.io/mhaptk/index.html.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • CpG Islands
  • DNA Methylation*
  • Haplotypes
  • High-Throughput Nucleotide Sequencing* / methods
  • Sequence Analysis, DNA / methods
  • Software