Regulatory potential, phyletic distribution and evolution of ancient, intracellular small-molecule-binding domains

J Mol Biol. 2001 Apr 13;307(5):1271-92. doi: 10.1006/jmbi.2001.4508.

Abstract

Central cellular functions such as metabolism, solute transport and signal transduction are regulated, in part, via binding of small molecules by specialized domains. Using sensitive methods for sequence profile analysis and protein structure comparison, we exhaustively surveyed the protein sets from completely sequenced genomes for all occurrences of 21 intracellular small-molecule-binding domains (SMBDs) that are represented in at least two of the three major divisions of life (bacteria, archaea and eukaryotes). These included previously characterized domains such as PAS, GAF, ACT and ferredoxins, as well as three newly predicted SMBDs, namely the 4-vinyl reductase (4VR) domain, the NIFX domain and the 3-histidines (3H) domain. Although there are only a limited number of different superfamilies of these ancient SMBDs, they are present in numerous distinct proteins combined with various enzymatic, transport and signal-transducing domains. Most of the SMBDs show considerable evolutionary mobility and are involved in the generation of many lineage-specific domain architectures. Frequent re-invention of analogous architectures involving functionally related, but not homologous, domains was detected, such as, fusion of different SMBDs to several types of DNA-binding domains to form diverse transcription regulators in prokaryotes and eukaryotes. This is suggestive of similar selective forces affecting the diverse SMBDs and resulting in the formation of multidomain proteins that fit a limited number of functional stereotypes. Using the "guilt by association approach", the identification of SMBDs allowed prediction of functions and mode of regulation for a variety of previously uncharacterized proteins.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Archaeal Proteins / chemistry
  • Archaeal Proteins / metabolism
  • Bacterial Proteins / chemistry
  • Bacterial Proteins / metabolism
  • Carrier Proteins / chemistry
  • Carrier Proteins / metabolism
  • Computational Biology
  • Enzymes / chemistry
  • Enzymes / metabolism
  • Eukaryotic Cells / chemistry
  • Evolution, Molecular*
  • Genome
  • Membrane Proteins / chemistry
  • Membrane Proteins / metabolism
  • Models, Molecular
  • Molecular Sequence Data
  • Molecular Weight
  • Phylogeny*
  • Protein Binding
  • Protein Structure, Tertiary*
  • Proteins / chemistry*
  • Proteins / metabolism*
  • Receptor, trkA*
  • Sequence Alignment
  • Signal Transduction
  • Structure-Activity Relationship
  • Transcription Factors / chemistry
  • Transcription Factors / metabolism

Substances

  • Archaeal Proteins
  • Bacterial Proteins
  • Carrier Proteins
  • Enzymes
  • Membrane Proteins
  • Proteins
  • Transcription Factors
  • Receptor, trkA