Unconstrained generation of synthetic antibody-antigen structures to guide machine learning methodology for antibody specificity prediction

Philippe A Robert; Rahmad Akbar; Robert Frank; Milena Pavlović; Michael Widrich; Igor Snapkov; Andrei Slabodkin; Maria Chernigovskaya; Lonneke Scheffer; Eva Smorodina; Puneet Rawat; Brij Bhushan Mehta; Mai Ha Vu; Ingvild Frøberg Mathisen; Aurél Prósz; Krzysztof Abram; Alex Olar; Enkelejda Miho; Dag Trygve Tryslew Haug; Fridtjof Lund-Johansen; Sepp Hochreiter; Ingrid Hobæk Haff; Günter Klambauer; Geir Kjetil Sandve; Victor Greiff

doi:10.1038/s43588-022-00372-4

Unconstrained generation of synthetic antibody-antigen structures to guide machine learning methodology for antibody specificity prediction

Nat Comput Sci. 2022 Dec;2(12):845-865. doi: 10.1038/s43588-022-00372-4. Epub 2022 Dec 19.

Authors

Philippe A Robert^#¹, Rahmad Akbar^#², Robert Frank², Milena Pavlović³, Michael Widrich⁴, Igor Snapkov², Andrei Slabodkin², Maria Chernigovskaya², Lonneke Scheffer³, Eva Smorodina², Puneet Rawat², Brij Bhushan Mehta², Mai Ha Vu⁵, Ingvild Frøberg Mathisen², Aurél Prósz⁶, Krzysztof Abram⁷, Alex Olar⁸, Enkelejda Miho^{9

10

11}, Dag Trygve Tryslew Haug⁵, Fridtjof Lund-Johansen², Sepp Hochreiter^{4

12}, Ingrid Hobæk Haff¹³, Günter Klambauer⁴, Geir Kjetil Sandve³, Victor Greiff¹⁴

Affiliations

¹ Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway. [email protected].
² Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway.
³ Department of Informatics, University of Oslo, Oslo, Norway.
⁴ ELLIS Unit Linz and LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria.
⁵ Department of Linguistics and Scandinavian Studies, University of Oslo, Oslo, Norway.
⁶ Danish Cancer Society Research Center, Translational Cancer Genomics, Copenhagen, Denmark.
⁷ The Novo Nordisk Foundation Center for Biosustainability, Autoflow, DTU Biosustain and IT University of Copenhagen, Copenhagen, Denmark.
⁸ Department of Complex Systems in Physics, Eötvös Loránd University, Budapest, Hungary.
⁹ Institute of Medical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz, Switzerland.
¹⁰ aiNET GmbH, Basel, Switzerland.
¹¹ Swiss Institute of Bioinformatics, Lausanne, Switzerland.
¹² Institute of Advanced Research in Artificial Intelligence (IARAI), Vienna, Austria.
¹³ Department of Mathematics, University of Oslo, Oslo, Norway.
¹⁴ Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway. [email protected].

^# Contributed equally.

PMID: 38177393
DOI: 10.1038/s43588-022-00372-4

Abstract

Machine learning (ML) is a key technology for accurate prediction of antibody-antigen binding. Two orthogonal problems hinder the application of ML to antibody-specificity prediction and the benchmarking thereof: the lack of a unified ML formalization of immunological antibody-specificity prediction problems and the unavailability of large-scale synthetic datasets to benchmark real-world relevant ML methods and dataset design. Here we developed the Absolut! software suite that enables parameter-based unconstrained generation of synthetic lattice-based three-dimensional antibody-antigen-binding structures with ground-truth access to conformational paratope, epitope and affinity. We formalized common immunological antibody-specificity prediction problems as ML tasks and confirmed that for both sequence- and structure-based tasks, accuracy-based rankings of ML methods trained on experimental data hold for ML methods trained on Absolut!-generated data. The Absolut! framework has the potential to enable real-world relevant development and benchmarking of ML strategies for biotherapeutics design.

MeSH terms

Antibodies*
Antibody Specificity
Antigen-Antibody Reactions*
Epitopes / chemistry
Machine Learning

Substances

Antibodies
Epitopes

Abstract

MeSH terms

Substances

Grants and funding