Characterization of genes encoding multi-domain proteins in the genome of the filamentous nitrogen-fixing Cyanobacterium anabaena sp. strain PCC 7120

DNA Res. 2001 Dec 31;8(6):271-84. doi: 10.1093/dnares/8.6.271.

Abstract

Computational analysis of gene structures in the genome of Anabaena sp. PCC 7120 revealed the presence of a large number of genes encoding proteins with multiple functional domains. This was most evident in the genes for signal transduction pathway and the related systems. Comparison of the putative amino acid sequences of the gene products with those in the Pfam database indicated that and PAS domains which may be involved in signal recognition were extremely abundant in Anabaena: 87 GAF domains in 62 ORFs and 140 PAS domains in 59 ORFs. As for the two-component signal transduction system, 73, 53, and 77 genes for simple sensory His kinases, hybrid His kinases and simple response regulators, respectively, many of which contained additional domains of diverse functions, were presumptively assigned. A total of 52 ORFs encoding putative Hanks-type Ser/Thr protein kinases with various domains such as WD-repeat, GAF and His kinase domains, as well as genes for presumptive protein phosphatases, were also identified. In addition, genes for putative transcription factors and for proteins in the cAMP signal transduction system harbored complex gene structures with multiple domains.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Bacterial Proteins / genetics*
  • Cluster Analysis
  • Cyanobacteria / genetics*
  • Cyanobacteria / metabolism
  • Genes, Bacterial*
  • Genome, Bacterial
  • Multigene Family
  • Nitrogen Fixation
  • Phosphotransferases / genetics
  • Phosphotransferases / metabolism
  • Phylogeny
  • Protein Structure, Tertiary / genetics*
  • Signal Transduction

Substances

  • Bacterial Proteins
  • Phosphotransferases