Read Mapping for Hi-C Analysis

Simon Thomas Kelly; Kugui Tanaka; Chiaki Hosaka; Satoshi Yuhara

doi:10.1007/978-1-0716-4136-1_3

Read Mapping for Hi-C Analysis

Methods Mol Biol. 2025:2856:25-62. doi: 10.1007/978-1-0716-4136-1_3.

Authors

Simon Thomas Kelly¹, Kugui Tanaka², Chiaki Hosaka², Satoshi Yuhara³

Affiliations

¹ H.U. Group Research Institute, G.K./SRL Inc., Akiruno, Tokyo, Japan. [email protected].
² H.U. Group Research Institute, G.K./SRL Inc., Akiruno, Tokyo, Japan.
³ H.U. Group Research Institute, G.K./SRL Inc., Akiruno, Tokyo, Japan. [email protected].

PMID: 39283445
DOI: 10.1007/978-1-0716-4136-1_3

Abstract

Hi-C is a popular ligation-based technique to detect 3D physical chromosome structure within the nucleus using cross-linking and next-generation sequencing. As an unbiased genome-wide assay based on chromosome conformation capture, it provides rich insights into chromosome structure, dynamic chromosome folding and interactions, and the regulatory state of a cell. Bioinformatics analyses of Hi-C data require dedicated protocols as most genome alignment tools assume that both paired-end reads will map to the same chromosome, resulting in large two-dimensional matrices as processed data. Here, we outline the necessary steps to generate high-quality aligned Hi-C data by separately mapping each read while correcting for biases from restriction enzyme digests. We introduce our own custom open-source pipeline, which enables users to select an aligner of their choosing with high accuracy and performance. This enables users to generate high-resolution datasets with fast turnaround and fewer unmapped reads. Finally, we discuss recent innovations in experimental techniques, bioinformatics techniques, and their applications in clinical testing for diagnostics.

Keywords: Alignment; Bioinformatics; Hi-C; Mapping.

MeSH terms

Chromatin / chemistry
Chromatin / genetics
Chromosome Mapping* / methods
Chromosomes / genetics
Computational Biology* / methods
Genomics / methods
High-Throughput Nucleotide Sequencing* / methods
Humans
Software*

Substances

Chromatin