Toward real-world automated antibody design with combinatorial Bayesian optimization

Cell Rep Methods. 2023 Jan 3;3(1):100374. doi: 10.1016/j.crmeth.2022.100374. eCollection 2023 Jan 23.

Abstract

Antibodies are multimeric proteins capable of highly specific molecular recognition. The complementarity determining region 3 of the antibody variable heavy chain (CDRH3) often dominates antigen-binding specificity. Hence, it is a priority to design optimal antigen-specific CDRH3 to develop therapeutic antibodies. The combinatorial structure of CDRH3 sequences makes it impossible to query binding-affinity oracles exhaustively. Moreover, antibodies are expected to have high target specificity and developability. Here, we present AntBO, a combinatorial Bayesian optimization framework utilizing a CDRH3 trust region for an in silico design of antibodies with favorable developability scores. The in silico experiments on 159 antigens demonstrate that AntBO is a step toward practically viable in vitro antibody design. In under 200 calls to the oracle, AntBO suggests antibodies outperforming the best binding sequence from 6.9 million experimentally obtained CDRH3s. Additionally, AntBO finds very-high-affinity CDRH3 in only 38 protein designs while requiring no domain knowledge.

Keywords: Bayesian optimization; Gaussian processes; combinatorial Bayesian optimization; computational antibody design; machine learning; protein engineering; structural biology.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Antibodies* / therapeutic use
  • Antigens
  • Bayes Theorem
  • Complementarity Determining Regions* / genetics
  • Immunoglobulin Heavy Chains / chemistry

Substances

  • Antibodies
  • Complementarity Determining Regions
  • Immunoglobulin Heavy Chains
  • Antigens