PrenDB, a Substrate Prediction Database to Enable Biocatalytic Use of Prenyltransferases

J Biol Chem. 2017 Mar 10;292(10):4003-4021. doi: 10.1074/jbc.M116.759118. Epub 2016 Dec 22.

Abstract

Prenyltransferases of the dimethylallyltryptophan synthase (DMATS) superfamily catalyze the attachment of prenyl or prenyl-like moieties to diverse acceptor compounds. These acceptor molecules are generally aromatic in nature and mostly indole or indole-like. Their catalytic transformation represents a major skeletal diversification step in the biosynthesis of secondary metabolites, including the indole alkaloids. DMATS enzymes thus contribute significantly to the biological and pharmacological diversity of small molecule metabolites. Understanding the substrate specificity of these enzymes could create opportunities for their biocatalytic use in preparing complex synthetic scaffolds. However, there has been no framework to achieve this in a rational way. Here, we report a chemoinformatic pipeline to enable prenyltransferase substrate prediction. We systematically catalogued 32 unique prenyltransferases and 167 unique substrates to create possible reaction matrices and compiled these data into a browsable database named PrenDB. We then used a newly developed algorithm based on molecular fragmentation to automatically extract reactive chemical epitopes. The analysis of the collected data sheds light on the thus far explored substrate space of DMATS enzymes. To assess the predictive performance of our virtual reaction extraction tool, 38 potential substrates were tested as prenyl acceptors in assays with three prenyltransferases, and we were able to detect turnover in >55% of the cases. The database, PrenDB (www.kolblab.org/prendb.php), enables the prediction of potential substrates for chemoenzymatic synthesis through substructure similarity and virtual chemical transformation techniques. It aims at making prenyltransferases and their highly regio- and stereoselective reactions accessible to the research community for integration in synthetic work flows.

Keywords: bioinformatics; chemoinformatics; computer modeling; enzyme catalysis; high-throughput screening (HTS); prenyltransferases; reaction database; substrate prediction; substrate specificity.

MeSH terms

  • Algorithms
  • Alkyl and Aryl Transferases / chemistry*
  • Alkyl and Aryl Transferases / metabolism*
  • Biocatalysis
  • Crystallography, X-Ray
  • Databases, Protein*
  • Indoles / chemistry
  • Indoles / metabolism*
  • Protein Prenylation*
  • Substrate Specificity

Substances

  • Indoles
  • indole
  • Alkyl and Aryl Transferases
  • tryptophan dimethylallyltransferase

Associated data

  • PDB/3I4X
  • PDB/3O2K
  • PDB/4E0U