High content of proteins containing 21st and 22nd amino acids, selenocysteine and pyrrolysine, in a symbiotic deltaproteobacterium of gutless worm Olavius algarvensis

Nucleic Acids Res. 2007;35(15):4952-63. doi: 10.1093/nar/gkm514. Epub 2007 Jul 11.

Abstract

Selenocysteine (Sec) and pyrrolysine (Pyl) are rare amino acids that are cotranslationally inserted into proteins and known as the 21st and 22nd amino acids in the genetic code. Sec and Pyl are encoded by UGA and UAG codons, respectively, which normally serve as stop signals. Herein, we report on unusually large selenoproteomes and pyrroproteomes in a symbiont metagenomic dataset of a marine gutless worm, Olavius algarvensis. We identified 99 selenoprotein genes that clustered into 30 families, including 17 new selenoprotein genes that belong to six families. In addition, several Pyl-containing proteins were identified in this dataset. Most selenoproteins and Pyl-containing proteins were present in a single deltaproteobacterium, delta1 symbiont, which contained the largest number of both selenoproteins and Pyl-containing proteins of any organism reported to date. Our data contrast with the previous observations that symbionts and host-associated bacteria either lose Sec utilization or possess a limited number of selenoproteins, and suggest that the environment in the gutless worm promotes Sec and Pyl utilization. Anaerobic conditions and consistent selenium supply might be the factors that support the use of amino acids that extend the genetic code.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Animals
  • Bacterial Proteins / chemistry*
  • Deltaproteobacteria / genetics*
  • Genome, Bacterial
  • Lysine / analogs & derivatives*
  • Lysine / analysis
  • Molecular Sequence Data
  • Oligochaeta / microbiology*
  • Proteomics
  • Selenocysteine / analysis*
  • Selenoproteins / chemistry*
  • Selenoproteins / genetics
  • Sequence Alignment
  • Symbiosis

Substances

  • Bacterial Proteins
  • Selenoproteins
  • Selenocysteine
  • pyrrolysine
  • Lysine