Data-Driven Protein Engineering for Improving Catalytic Activity and Selectivity

Chembiochem. 2024 Feb 1;25(3):e202300754. doi: 10.1002/cbic.202300754. Epub 2023 Dec 11.

Abstract

Protein engineering is essential for altering the substrate scope, catalytic activity and selectivity of enzymes for applications in biocatalysis. However, traditional approaches, such as directed evolution and rational design, encounter the challenge in dealing with the experimental screening process of a large protein mutation space. Machine learning methods allow the approximation of protein fitness landscapes and the identification of catalytic patterns using limited experimental data, thus providing a new avenue to guide protein engineering campaigns. In this concept article, we review machine learning models that have been developed to assess enzyme-substrate-catalysis performance relationships aiming to improve enzymes through data-driven protein engineering. Furthermore, we prospect the future development of this field to provide additional strategies and tools for achieving desired activities and selectivities.

Keywords: Biocatalysis; catalytic activity; machine learning; protein engineering; selectivity.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biocatalysis
  • Catalysis
  • Enzymes / genetics
  • Enzymes / metabolism
  • Mutation
  • Protein Engineering* / methods
  • Proteins* / genetics
  • Proteins* / metabolism

Substances

  • Enzymes
  • Proteins