Evolution of substrate recognition across a multigene family of glycosyltransferases in Arabidopsis

Glycobiology. 2003 Mar;13(3):139-45. doi: 10.1093/glycob/cwg017. Epub 2002 Nov 1.

Abstract

The complete sequence of the Arabidopsis genome enables definitive characterization of multigene families and analysis of their phylogenetic relationships. Using a consensus sequence previously defined for glycosyltransferases that use small-molecular-weight acceptors, 107 gene sequences were identified in the Arabidopsis genome and used to construct a phylogenetic tree. Screening recombinant proteins for their catalytic activities in vitro has revealed enzymes active toward physiologically important substrates, including hormones and secondary metabolites. The aim of this study has been to use the phylogenetic relationships across the entire family to explore the evolution of substrate recognition and regioselectivity of glucosylation. Hydroxycoumarins have been used as the model substrates for the analysis in which 90 sequences have been assayed and 48 sequences shown to recognize these compounds. The study has revealed activity in 6 of the 14 phylogenetic groups of the multigene family, suggesting that basic features of substrate recognition are retained across substantial evolutionary periods.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis / enzymology*
  • Arabidopsis / genetics
  • Arabidopsis Proteins / genetics
  • Arabidopsis Proteins / metabolism*
  • Catalysis
  • Consensus Sequence
  • Coumarins / metabolism
  • Evolution, Molecular*
  • Genes, Plant / genetics
  • Glycosylation
  • Glycosyltransferases / genetics
  • Glycosyltransferases / metabolism*
  • Molecular Structure
  • Multigene Family*
  • Phylogeny
  • Substrate Specificity

Substances

  • Arabidopsis Proteins
  • Coumarins
  • Glycosyltransferases