Widespread non-coding polymorphism in HLA class II genes of International HLA and Immunogenetics Workshop cell lines

HLA. 2022 Apr;99(4):328-356. doi: 10.1111/tan.14571. Epub 2022 Feb 14.

Abstract

As the primary genetic determinant of immune recognition of self and non-self, the hyperpolymorphic HLA genes play key roles in disease association and transplantation. The large, variably sized HLA class II genes have historically been less well characterized than the shorter HLA class I genes. Here, we have used Pacific Biosciences Single Molecule Real-Time (SMRT®) DNA sequencing to perform four-field resolution HLA typing of HLA-DRB1/3/4/5, -DQA1, -DQB1, -DPA1 and -DPB1 from a panel of 181 B-lymphoblastoid cell lines from the International HLA and Immunogenetics Workshops. By interrogating all exons, introns, and the untranslated regions of these important reference cells, we have improved their HLA typing resolution on the IPD-IMGT/HLA database. We observed widespread non-coding polymorphism, with over twice as many unique genomic sequences identified compared with coding sequences (CDS). We submitted 263 unique sequences to the IPD-IMGT/HLA Database, often from multiple cell lines, including 114 confirmations of existing alleles, of which 30 were also extensions to full-length genomic sequences where only CDS was available previously. A total of 149 novel alleles were identified, largely differing from their closest reference allele sequences by a single nucleotide polymorphism (SNP). However, some highly divergent alleles were deemed to be recombinants, only detectable by full-length sequencing with long, phased reads. The fourth-field variation we observed allowed fine mapping of linkage disequilibrium patterns and haplotypes to particular ancestries. This study has highlighted the under-appreciated non-coding diversity in HLA class II genes, with potential implications for population genetic and clinical studies.

Keywords: B-lymphoblastoid cell lines; HLA class II; International HLA and Immunogenetics Workshop; SMRT sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alleles
  • Cell Line
  • Gene Frequency
  • Genes, MHC Class II*
  • Haplotypes
  • Humans
  • Immunogenetics*