HLA-II immunopeptidome profiling and deep learning reveal features of antigenicity to inform antigen discovery

Immunity. 2023 Jul 11;56(7):1681-1698.e13. doi: 10.1016/j.immuni.2023.05.009. Epub 2023 Jun 9.

Abstract

CD4+ T cell responses are exquisitely antigen specific and directed toward peptide epitopes displayed by human leukocyte antigen class II (HLA-II) on antigen-presenting cells. Underrepresentation of diverse alleles in ligand databases and an incomplete understanding of factors affecting antigen presentation in vivo have limited progress in defining principles of peptide immunogenicity. Here, we employed monoallelic immunopeptidomics to identify 358,024 HLA-II binders, with a particular focus on HLA-DQ and HLA-DP. We uncovered peptide-binding patterns across a spectrum of binding affinities and enrichment of structural antigen features. These aspects underpinned the development of context-aware predictor of T cell antigens (CAPTAn), a deep learning model that predicts peptide antigens based on their affinity to HLA-II and full sequence of their source proteins. CAPTAn was instrumental in discovering prevalent T cell epitopes from bacteria in the human microbiome and a pan-variant epitope from SARS-CoV-2. Together CAPTAn and associated datasets present a resource for antigen discovery and the unraveling genetic associations of HLA alleles with immunopathologies.

Keywords: CD4(+) T cells; MHC class II; SARS-CoV-2 antigens; antigen presentation; immunopeptidomics; microbiome antigens; protein sequence models.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • COVID-19*
  • Captan
  • Deep Learning*
  • Epitopes, T-Lymphocyte
  • HLA Antigens
  • Humans
  • Peptides
  • SARS-CoV-2

Substances

  • Captan
  • HLA Antigens
  • Epitopes, T-Lymphocyte
  • Peptides