Phylogenetic analyses suggest multiple changes of substrate specificity within the glycosyl hydrolase 20 family

BMC Evol Biol. 2008 Jul 22:8:214. doi: 10.1186/1471-2148-8-214.

Abstract

Background: Beta-N-acetylhexosaminidases belonging to the glycosyl hydrolase 20 (GH20) family are involved in the removal of terminal beta-glycosidacally linked N-acetylhexosamine residues. These enzymes, widely distributed in microorganisms, animals and plants, are involved in many important physiological and pathological processes, such as cell structural integrity, energy storage, pathogen defence, viral penetration, cellular signalling, fertilization, development of carcinomas, inflammatory events and lysosomal storage diseases. Nevertheless, only limited analyses of phylogenetic relationships between GH20 genes have been performed until now.

Results: Careful phylogenetic analyses of 233 inferred protein sequences from eukaryotes and prokaryotes reveal a complex history for the GH20 family. In bacteria, multiple gene duplications and lineage specific gene loss (and/or horizontal gene transfer) are required to explain the observed taxonomic distribution. The last common ancestor of extant eukaryotes is likely to have possessed at least one GH20 family member. At least one gene duplication before the divergence of animals, plants and fungi as well as other lineage specific duplication events have given rise to multiple paralogous subfamilies in eukaryotes. Phylogenetic analyses also suggest that a second, divergent subfamily of GH20 family genes present in animals derive from an independent prokaryotic source. Our data suggest multiple convergent changes of functional roles of GH20 family members in eukaryotes.

Conclusion: This study represents the first detailed evolutionary analysis of the glycosyl hydrolase GH20 family. Mapping of data concerning physiological function of GH20 family members onto the phylogenetic tree reveals that apparently convergent and highly lineage specific changes in substrate specificity have occurred in multiple GH20 subfamilies.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Eukaryotic Cells / classification
  • Evolution, Molecular*
  • Gene Order
  • Glycoside Hydrolases / chemistry
  • Glycoside Hydrolases / genetics*
  • Glycoside Hydrolases / metabolism*
  • Humans
  • Molecular Sequence Data
  • Phylogeny*
  • Prokaryotic Cells / classification
  • Sequence Alignment
  • Sequence Analysis, Protein
  • Substrate Specificity
  • beta-N-Acetylhexosaminidases / chemistry
  • beta-N-Acetylhexosaminidases / genetics

Substances

  • Glycoside Hydrolases
  • beta-N-Acetylhexosaminidases