Discovery of Novel Gain-of-Function Mutations Guided by Structure-Based Deep Learning

ACS Synth Biol. 2020 Nov 20;9(11):2927-2935. doi: 10.1021/acssynbio.0c00345. Epub 2020 Oct 16.

Abstract

Despite the promise of deep learning accelerated protein engineering, examples of such improved proteins are scarce. Here we report that a 3D convolutional neural network trained to associate amino acids with neighboring chemical microenvironments can guide identification of novel gain-of-function mutations that are not predicted by energetics-based approaches. Amalgamation of these mutations improved protein function in vivo across three diverse proteins by at least 5-fold. Furthermore, this model provides a means to interrogate the chemical space within protein microenvironments and identify specific chemical interactions that contribute to the gain-of-function phenotypes resulting from individual mutations.

Keywords: computational protein design; machine learning; neural networks; protein engineering.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Amino Acids / genetics
  • Deep Learning
  • Gain of Function Mutation / genetics*
  • Machine Learning
  • Neural Networks, Computer
  • Protein Engineering / methods
  • Proteins / genetics

Substances

  • Amino Acids
  • Proteins