Abstract
In the genetic code, UGA serves as a stop signal and a selenocysteine codon, but no computational methods for identifying its coding function are available. Consequently, most selenoprotein genes are misannotated. We identified selenoprotein genes in sequenced mammalian genomes by methods that rely on identification of selenocysteine insertion RNA structures, the coding potential of UGA codons, and the presence of cysteine-containing homologs. The human selenoproteome consists of 25 selenoproteins.
Publication types
-
Research Support, Non-U.S. Gov't
-
Research Support, U.S. Gov't, P.H.S.
MeSH terms
-
Amino Acid Sequence
-
Animals
-
Base Sequence
-
Codon
-
Codon, Terminator
-
Computational Biology
-
DNA Transposable Elements
-
Gene Expression Profiling
-
Genome, Human
-
Humans
-
Mice
-
Molecular Sequence Data
-
Open Reading Frames
-
Proteins / chemistry*
-
Proteins / genetics*
-
Proteome*
-
Rats
-
Selenium*
-
Selenocysteine / chemistry
-
Selenocysteine / genetics*
-
Selenoproteins
-
Sequence Alignment
-
Sequence Analysis, DNA
-
Sequence Homology, Amino Acid
-
Software
Substances
-
Codon
-
Codon, Terminator
-
DNA Transposable Elements
-
Proteins
-
Proteome
-
Selenoproteins
-
Selenocysteine
-
Selenium