MHConstructor: A high-throughput, haplotype-informed solution to the MHC assembly challenge

bioRxiv [Preprint]. 2024 May 21:2024.05.20.595060. doi: 10.1101/2024.05.20.595060.

Abstract

The extremely high levels of genetic polymorphism within the human major histocompatibility complex (MHC) limit the usefulness of reference-based alignment methods for sequence assembly. We incorporate a short read de novo assembly algorithm into a workflow for novel application to the MHC. MHConstructor is a containerized pipeline designed for high-throughput, haplotype-informed, reproducible assembly of both whole genome sequencing and target-capture short read data in large, population cohorts. To-date, no other self-contained tool exists for the generation of de novo MHC assemblies from short read data. MHConstructor facilitates wide-spread access to high quality, alignment-free MHC sequence analysis.

Keywords: Major histocompatibility complex; de novo assembly; haplotype; human leukocyte antigen genes; short read sequencing.

Publication types

  • Preprint