An algorithm for identification of bacterial selenocysteine insertion sequence elements and selenoprotein genes

Bioinformatics. 2005 Jun 1;21(11):2580-9. doi: 10.1093/bioinformatics/bti400. Epub 2005 Mar 29.

Abstract

Motivation: Incorporation of selenocysteine (Sec) into proteins in response to UGA codons requires a cis-acting RNA structure, Sec insertion sequence (SECIS) element. Whereas SECIS elements in Escherichia coli are well characterized, a bacterial SECIS consensus structure is lacking.

Results: We developed a bacterial SECIS consensus model, the key feature of which is a conserved guanosine in a small apical loop of the properly positioned structure. This consensus was used to build a computational tool, bSECISearch, for detection of bacterial SECIS elements and selenoprotein genes in sequence databases. The program identified 96.5% of known selenoprotein genes in completely sequenced bacterial genomes and predicted several new selenoprotein genes. Further analysis revealed that the size of bacterial selenoproteomes varied from 1 to 11 selenoproteins. Formate dehydrogenase was present in most selenoproteomes, often as the only selenoprotein family, whereas the occurrence of other selenoproteins was limited. The availability of the bacterial SECIS consensus and the tool for identification of these structures should help in correct annotation of selenoprotein genes and characterization of bacterial selenoproteomes.

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms*
  • Bacterial Proteins / genetics*
  • Chromosome Mapping / methods*
  • DNA Transposable Elements / genetics*
  • DNA, Bacterial / genetics*
  • Proteins / genetics*
  • Selenoproteins
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*
  • Sequence Homology, Nucleic Acid
  • Software

Substances

  • Bacterial Proteins
  • DNA Transposable Elements
  • DNA, Bacterial
  • Proteins
  • Selenoproteins