Engineering indel and substitution variants of diverse and ancient enzymes using Graphical Representation of Ancestral Sequence Predictions (GRASP)

PLoS Comput Biol. 2022 Oct 24;18(10):e1010633. doi: 10.1371/journal.pcbi.1010633. eCollection 2022 Oct.

Abstract

Ancestral sequence reconstruction is a technique that is gaining widespread use in molecular evolution studies and protein engineering. Accurate reconstruction requires the ability to handle appropriately large numbers of sequences, as well as insertion and deletion (indel) events, but available approaches exhibit limitations. To address these limitations, we developed Graphical Representation of Ancestral Sequence Predictions (GRASP), which efficiently implements maximum likelihood methods to enable the inference of ancestors of families with more than 10,000 members. GRASP implements partial order graphs (POGs) to represent and infer insertion and deletion events across ancestors, enabling the identification of building blocks for protein engineering. To validate the capacity to engineer novel proteins from realistic data, we predicted ancestor sequences across three distinct enzyme families: glucose-methanol-choline (GMC) oxidoreductases, cytochromes P450, and dihydroxy/sugar acid dehydratases (DHAD). All tested ancestors demonstrated enzymatic activity. Our study demonstrates the ability of GRASP (1) to support large data sets over 10,000 sequences and (2) to employ insertions and deletions to identify building blocks for engineering biologically active ancestors, by exploring variation over evolutionary time.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biological Evolution
  • Evolution, Molecular*
  • INDEL Mutation* / genetics
  • Phylogeny
  • Proteins / genetics

Substances

  • Proteins

Grants and funding

This work has been supported by the Australian Research Council (ARC; arc.gov.au) Discovery Project grants 210101802 to GS, LG and MB, 160100865 to MB, EG, BK and BR and 120101772 to MB and EG. BK is an ARC Laureate Fellow (FL180100109). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.