Computational protein design: the Proteus software and selected applications

J Comput Chem. 2013 Oct 30;34(28):2472-84. doi: 10.1002/jcc.23418. Epub 2013 Aug 26.

Abstract

We describe an automated procedure for protein design, implemented in a flexible software package, called Proteus. System setup and calculation of an energy matrix are done with the XPLOR modeling program and its sophisticated command language, supporting several force fields and solvent models. A second program provides algorithms to search sequence space. It allows a decomposition of the system into groups, which can be combined in different ways in the energy function, for both positive and negative design. The whole procedure can be controlled by editing 2-4 scripts. Two applications consider the tyrosyl-tRNA synthetase enzyme and its successful redesign to bind both O-methyl-tyrosine and D-tyrosine. For the latter, we present Monte Carlo simulations where the D-tyrosine concentration is gradually increased, displacing L-tyrosine from the binding pocket and yielding the binding free energy difference, in good agreement with experiment. Complete redesign of the Crk SH3 domain is presented. The top 10000 sequences are all assigned to the correct fold by the SUPERFAMILY library of Hidden Markov Models. Finally, we report the acid/base behavior of the SNase protein. Sidechain protonation is treated as a form of mutation; it is then straightforward to perform constant-pH Monte Carlo simulations, which yield good agreement with experiment. Overall, the software can be used for a wide range of application, producing not only native-like sequences but also thermodynamic properties with errors that appear comparable to other current software packages.

Keywords: Monte Carlo; aminoacyl-tRNA synthetase; computer simulation; molecular recognition; protein engineering.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology*
  • Hydrogen-Ion Concentration
  • Models, Molecular
  • Molecular Dynamics Simulation
  • Monte Carlo Method
  • Protein Unfolding
  • Proteins / chemistry*
  • Proto-Oncogene Proteins c-crk / chemistry
  • Software*
  • Tyrosine / analogs & derivatives
  • Tyrosine / chemistry
  • Tyrosine / metabolism
  • Tyrosine-tRNA Ligase / chemistry
  • Tyrosine-tRNA Ligase / metabolism
  • src Homology Domains

Substances

  • Proteins
  • Proto-Oncogene Proteins c-crk
  • Tyrosine
  • Tyrosine-tRNA Ligase