Predictability of antigen binding based on short motifs in the antibody CDRH3

Lonneke Scheffer; Eric Emanuel Reber; Brij Bhushan Mehta; Milena Pavlović; Maria Chernigovskaya; Eve Richardson; Rahmad Akbar; Fridtjof Lund-Johansen; Victor Greiff; Ingrid Hobæk Haff; Geir Kjetil Sandve

doi:10.1093/bib/bbae537

Predictability of antigen binding based on short motifs in the antibody CDRH3

Brief Bioinform. 2024 Sep 23;25(6):bbae537. doi: 10.1093/bib/bbae537.

Affiliations

¹ Department of Informatics, University of Oslo, Gaustadalléen 23B, 0373 Oslo, Norway.
² Department of Immunology, University of Oslo, Sognsvannsveien 20, Rikshospitalet, 0372 Oslo, Norway.
³ La Jolla Institute for Immunology, 9420 Athena Cir, La Jolla, CA, United States.
⁴ Department of Mathematics, University of Oslo, Niels Henrik Abels hus, Moltke Moes vei 35, 0851 Oslo, Norway.

Abstract

Adaptive immune receptors, such as antibodies and T-cell receptors, recognize foreign threats with exquisite specificity. A major challenge in adaptive immunology is discovering the rules governing immune receptor-antigen binding in order to predict the antigen binding status of previously unseen immune receptors. Many studies assume that the antigen binding status of an immune receptor may be determined by the presence of a short motif in the complementarity determining region 3 (CDR3), disregarding other amino acids. To test this assumption, we present a method to discover short motifs which show high precision in predicting antigen binding and generalize well to unseen simulated and experimental data. Our analysis of a mutagenesis-based antibody dataset reveals 11 336 position-specific, mostly gapped motifs of 3-5 amino acids that retain high precision on independently generated experimental data. Using a subset of only 178 motifs, a simple classifier was made that on the independently generated dataset outperformed a deep learning model proposed specifically for such datasets. In conclusion, our findings support the notion that for some antibodies, antigen binding may be largely determined by a short CDR3 motif. As more experimental data emerge, our methodology could serve as a foundation for in-depth investigations into antigen binding signals.

Keywords: adaptive immunology; antigen binding; computational immunology; machine learning; motif discovery.

MeSH terms

Amino Acid Motifs*
Antibodies / chemistry
Antibodies / immunology
Antibodies / metabolism
Antigens* / chemistry
Antigens* / immunology
Antigens* / metabolism
Complementarity Determining Regions* / chemistry
Complementarity Determining Regions* / genetics
Complementarity Determining Regions* / immunology
Computational Biology / methods
Deep Learning
Humans
Protein Binding

Substances

Complementarity Determining Regions
Antigens
Antibodies