Read Mapping for Hi-C Analysis

Methods Mol Biol. 2025:2856:25-62. doi: 10.1007/978-1-0716-4136-1_3.

Abstract

Hi-C is a popular ligation-based technique to detect 3D physical chromosome structure within the nucleus using cross-linking and next-generation sequencing. As an unbiased genome-wide assay based on chromosome conformation capture, it provides rich insights into chromosome structure, dynamic chromosome folding and interactions, and the regulatory state of a cell. Bioinformatics analyses of Hi-C data require dedicated protocols as most genome alignment tools assume that both paired-end reads will map to the same chromosome, resulting in large two-dimensional matrices as processed data. Here, we outline the necessary steps to generate high-quality aligned Hi-C data by separately mapping each read while correcting for biases from restriction enzyme digests. We introduce our own custom open-source pipeline, which enables users to select an aligner of their choosing with high accuracy and performance. This enables users to generate high-resolution datasets with fast turnaround and fewer unmapped reads. Finally, we discuss recent innovations in experimental techniques, bioinformatics techniques, and their applications in clinical testing for diagnostics.

Keywords: Alignment; Bioinformatics; Hi-C; Mapping.

MeSH terms

  • Chromatin / chemistry
  • Chromatin / genetics
  • Chromosome Mapping* / methods
  • Chromosomes / genetics
  • Computational Biology* / methods
  • Genomics / methods
  • High-Throughput Nucleotide Sequencing* / methods
  • Humans
  • Software*

Substances

  • Chromatin