Protein engineering using variational free energy approximation

Nat Commun. 2024 Dec 1;15(1):10447. doi: 10.1038/s41467-024-54814-w.

Abstract

Engineering proteins is a challenging task requiring the exploration of a vast design space. Traditionally, this is achieved using Directed Evolution (DE), which is a laborious process. Generative deep learning, instead, can learn biological features of functional proteins from sequence and structural datasets and return novel variants. However, most models do not generate thermodynamically stable proteins, thus leading to many non-functional variants. Here we propose a model called PRotein Engineering by Variational frEe eNergy approximaTion (PREVENT), which generates stable and functional variants by learning the sequence and thermodynamic landscape of a protein. We evaluate PREVENT by designing 40 variants of the conditionally essential E. coli phosphotransferase N-acetyl-L-glutamate kinase (EcNAGK). We find 85% of the variants to be functional, with 55% of them showing similar growth rate compared to the wildtype enzyme, despite harbouring up to 9 mutations. Our results support a new approach that can significantly accelerate protein engineering.

MeSH terms

  • Escherichia coli Proteins / chemistry
  • Escherichia coli Proteins / genetics
  • Escherichia coli Proteins / metabolism
  • Escherichia coli* / genetics
  • Escherichia coli* / metabolism
  • Mutation
  • Protein Engineering* / methods
  • Thermodynamics*

Substances

  • Escherichia coli Proteins