An enumerative algorithm for de novo design of proteins with diverse pocket structures

Proc Natl Acad Sci U S A. 2020 Sep 8;117(36):22135-22145. doi: 10.1073/pnas.2005412117. Epub 2020 Aug 24.

Abstract

To create new enzymes and biosensors from scratch, precise control over the structure of small-molecule binding sites is of paramount importance, but systematically designing arbitrary protein pocket shapes and sizes remains an outstanding challenge. Using the NTF2-like structural superfamily as a model system, we developed an enumerative algorithm for creating a virtually unlimited number of de novo proteins supporting diverse pocket structures. The enumerative algorithm was tested and refined through feedback from two rounds of large-scale experimental testing, involving in total the assembly of synthetic genes encoding 7,896 designs and assessment of their stability on yeast cell surface, detailed biophysical characterization of 64 designs, and crystal structures of 5 designs. The refined algorithm generates proteins that remain folded at high temperatures and exhibit more pocket diversity than naturally occurring NTF2-like proteins. We expect this approach to transform the design of small-molecule sensors and enzymes by enabling the creation of binding and active site geometries much more optimal for specific design challenges than is accessible by repurposing the limited number of naturally occurring NTF2-like proteins.

Keywords: NTF2-like proteins; high-throughput screening; protein design; protein pockets.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Binding Sites
  • Computer Simulation
  • High-Throughput Screening Assays
  • Models, Molecular
  • Nucleocytoplasmic Transport Proteins / chemistry*
  • Protein Conformation
  • Protein Engineering
  • Protein Stability

Substances

  • Nucleocytoplasmic Transport Proteins