Hi-C is a popular ligation-based technique to detect 3D physical chromosome structure within the nucleus using cross-linking and next-generation sequencing. As an unbiased genome-wide assay based on chromosome conformation capture, it provides rich insights into chromosome structure, dynamic chromosome folding and interactions, and the regulatory state of a cell. Bioinformatics analyses of Hi-C data require dedicated protocols as most genome alignment tools assume that both paired-end reads will map to the same chromosome, resulting in large two-dimensional matrices as processed data. Here, we outline the necessary steps to generate high-quality aligned Hi-C data by separately mapping each read while correcting for biases from restriction enzyme digests. We introduce our own custom open-source pipeline, which enables users to select an aligner of their choosing with high accuracy and performance. This enables users to generate high-resolution datasets with fast turnaround and fewer unmapped reads. Finally, we discuss recent innovations in experimental techniques, bioinformatics techniques, and their applications in clinical testing for diagnostics.
Keywords: Alignment; Bioinformatics; Hi-C; Mapping.
© 2025. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.