HiCLift: a fast and efficient tool for converting chromatin interaction data between genome assemblies

Bioinformatics. 2023 Jun 1;39(6):btad389. doi: 10.1093/bioinformatics/btad389.

Abstract

Motivation: With the continuous effort to improve the quality of human reference genome and the generation of more and more personal genomes, the conversion of genomic coordinates between genome assemblies is critical in many integrative and comparative studies. While tools have been developed for such task for linear genome signals such as ChIP-Seq, no tool exists to convert genome assemblies for chromatin interaction data, despite the importance of three-dimensional genome organization in gene regulation and disease.

Results: Here, we present HiCLift, a fast and efficient tool that can convert the genomic coordinates of chromatin contacts such as Hi-C and Micro-C from one assembly to another, including the latest T2T-CHM13 genome. Comparing with the strategy of directly remapping raw reads to a different genome, HiCLift runs on average 42 times faster (hours vs. days), while outputs nearly identical contact matrices. More importantly, as HiCLift does not need to remap the raw reads, it can directly convert human patient sample data, where the raw sequencing reads are sometimes hard to acquire or not available.

Availability and implementation: HiCLift is publicly available at https://github.com/XiaoTaoWang/HiCLift.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Chromatin* / genetics
  • Genome, Human
  • Genomics
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Sequence Analysis, DNA / methods
  • Software*

Substances

  • Chromatin