Understanding the adaptation of Halobacterium species NRC-1 to its extreme environment through computational analysis of its genome sequence

Genome Res. 2001 Oct;11(10):1641-50. doi: 10.1101/gr.190201.

Abstract

The genome of the halophilic archaeon Halobacterium sp. NRC-1 and predicted proteome have been analyzed by computational methods and reveal characteristics relevant to life in an extreme environment distinguished by hypersalinity and high solar radiation: (1) The proteome is highly acidic, with a median pI of 4.9 and mostly lacking basic proteins. This characteristic correlates with high surface negative charge, determined through homology modeling, as the major adaptive mechanism of halophilic proteins to function in nearly saturating salinity. (2) Codon usage displays the expected GC bias in the wobble position and is consistent with a highly acidic proteome. (3) Distinct genomic domains of NRC-1 with bacterial character are apparent by whole proteome BLAST analysis, including two gene clusters coding for a bacterial-type aerobic respiratory chain. This result indicates that the capacity of halophiles for aerobic respiration may have been acquired through lateral gene transfer. (4) Two regions of the large chromosome were found with relatively lower GC composition and overrepresentation of IS elements, similar to the minichromosomes. These IS-element-rich regions of the genome may serve to exchange DNA between the three replicons and promote genome evolution. (5) GC-skew analysis showed evidence for the existence of two replication origins in the large chromosome. This finding and the occurrence of multiple chromosomes indicate a dynamic genome organization with eukaryotic character.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Adaptation, Biological / genetics*
  • Base Composition
  • Chromosome Mapping
  • Codon / genetics
  • Computational Biology / methods*
  • Genome, Bacterial*
  • Halobacterium / genetics*
  • Halobacterium / growth & development
  • Isoelectric Point
  • Multigene Family
  • Protein Structure, Tertiary / genetics
  • Proteome / genetics
  • Sequence Homology, Amino Acid

Substances

  • Codon
  • Proteome