Score-based generative modeling for de novo protein design

Nat Comput Sci. 2023 May;3(5):382-392. doi: 10.1038/s43588-023-00440-3. Epub 2023 May 4.

Abstract

The generation of de novo protein structures with predefined functions and properties remains a challenging problem in protein design. Diffusion models, also known as score-based generative models (SGMs), have recently exhibited astounding empirical performance in image synthesis. Here we use image-based representations of protein structure to develop ProteinSGM, a score-based generative model that produces realistic de novo proteins. Through unconditional generation, we show that ProteinSGM can generate native-like protein structures, surpassing the performance of previously reported generative models. We experimentally validate some de novo designs and observe secondary structure compositions consistent with generated backbones. Finally, we apply conditional generation to de novo protein design by formulating it as an image inpainting problem, allowing precise and modular design of protein structure.

MeSH terms

  • Protein Structure, Secondary
  • Proteins* / chemistry

Substances

  • Proteins