Exploring the structural diversity of mammalian carbohydrates ("glycospace") by statistical databank analysis

ACS Chem Biol. 2007 Oct 19;2(10):685-91. doi: 10.1021/cb700178s.

Abstract

The diversity of three major classes of mammalian carbohydrates, mainly glycolipids and O- and N-linked glycans, deposited in the databank GLYCOSCIENCES.de was subjected to statistical analyses. Size, chain length, and branching complexity were accessed and revealed that the average oligosaccharide is composed of about eight monosaccharide units. About a quarter of all oligosaccharides are strictly linear, and the remainder are branched at least once. Glucosamine, galactose, and mannose are dominating and comprise ~75% of the monosaccharides within mammalian oligosaccharide frameworks. alpha-Linked sialic acid, alpha-linked fucose, and beta-linked galactose decorate the majority of reducing termini. Glucose as the most abundant carbohydrate in mammals plays only a very minor role within these structures. Particular emphasis was placed on analyzing the way the monosaccharide units are linked within the oligomeric framework. Just 11 monosaccharide connections account for >75% of all linkages. Thus, the number of structural combinations found in nature, the part of the occupied mammalian glycospace, is much smaller than expected. As a result, a potential set of building blocks for oligosaccharide assembly is presented. This potential building block set was correlated with the accessible 3299 mammalian carbohydrate structures in the GLYCOSCIENCES.de databank. Only 36 building blocks are required to construct 75% of the 3299 mammalian oligosaccharides.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Data Interpretation, Statistical*
  • Databases, Factual*
  • Glycolipids / chemistry*
  • Humans
  • Oligosaccharides / chemistry*
  • Polysaccharides / chemistry*

Substances

  • Glycolipids
  • Oligosaccharides
  • Polysaccharides