Correlation between sequence conservation of the 5' untranslated region and codon usage bias in Mus musculus genes

Gene. 2001 Oct 3;276(1-2):101-5. doi: 10.1016/s0378-1119(01)00671-0.

Abstract

The codon adaptation index (CAI) values of all protein-coding sequences of the full-length cDNA libraries of Mus musculus were computed based on the RIKEN mouse full-length cDNA library. We have also computed the extent of consensus in flanking sequences of the initiator ATG codon based on the 'relative entropy' values of respective nucleotide positions (from -20 to +12 bp relative to the initiator ATG codon) for each group of genes classified by CAI values. With regard to the two nucleotides positions (-3 and +4) known to be highly conserved in Kozak's consensus sequence, a clear correlation between CAI values and relative entropy values was observed at position -3 but this was not significant at position +4, although a significant correlation was found at position -1 of the consensus sequence. Further, although no correlation was observed at any additional positions, relative entropy values were very high at positions -4, -6, and -8 in genes with high CAI values. These findings suggest that the extent of conservation in the flanking sequence of the initiator ATG codon including Kozak's consensus sequence was an important factor in modulation of the translation efficiency as well as synonymous codon usage bias particularly in highly expressed genes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • 5' Untranslated Regions / genetics*
  • Animals
  • Base Composition
  • Codon / genetics*
  • Conserved Sequence
  • DNA, Complementary / genetics*
  • DNA-Directed RNA Polymerases / genetics
  • Genes / genetics
  • Heat-Shock Proteins / genetics
  • Mice
  • Peptide Elongation Factors / genetics
  • Ribosomal Proteins / genetics

Substances

  • 5' Untranslated Regions
  • Codon
  • DNA, Complementary
  • Heat-Shock Proteins
  • Peptide Elongation Factors
  • Ribosomal Proteins
  • DNA-Directed RNA Polymerases