Computationally mapping sequence space to understand evolutionary protein engineering

Biotechnol Prog. 2008 Jan-Feb;24(1):62-73. doi: 10.1021/bp070134h. Epub 2007 Nov 17.

Abstract

Evolutionary protein engineering has been dramatically successful, producing a wide variety of new proteins with altered stability, binding affinity, and enzymatic activity. However, the success of such procedures is often unreliable, and the impact of the choice of protein, engineering goal, and evolutionary procedure is not well understood. We have created a framework for understanding aspects of the protein engineering process by computationally mapping regions of feasible sequence space for three small proteins using structure-based design protocols. We then tested the ability of different evolutionary search strategies to explore these sequence spaces. The results point to a non-intuitive relationship between the error-prone PCR mutation rate and the number of rounds of replication. The evolutionary relationships among feasible sequences reveal hub-like sequences that serve as particularly fruitful starting sequences for evolutionary search. Moreover, genetic recombination procedures were examined, and tradeoffs relating sequence diversity and search efficiency were identified. This framework allows us to consider the impact of protein structure on the allowed sequence space and therefore on the challenges that each protein presents to error-prone PCR and genetic recombination procedures.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Aprotinin / chemistry
  • Bacterial Proteins / chemistry
  • Base Sequence
  • Cattle
  • Computer Simulation*
  • Models, Molecular
  • Polymerase Chain Reaction
  • Protein Engineering / methods*
  • Protein Engineering / trends
  • Protein Structure, Tertiary
  • Proteins / chemistry*

Substances

  • Bacterial Proteins
  • IgG Fc-binding protein, Streptococcus
  • Proteins
  • Aprotinin