From reporters to endogenous genes: the impact of the first five codons on translation efficiency in Escherichia coli

RNA Biol. 2019 Dec;16(12):1806-1816. doi: 10.1080/15476286.2019.1661213. Epub 2019 Sep 5.

Abstract

Translation initiation is a critical step in the regulation of protein synthesis, and it is subjected to different control mechanisms, such as 5' UTR secondary structure and initiation codon context, that can influence the rates at which initiation and consequentially translation occur. For some genes, translation elongation also affects the rate of protein synthesis. With a GFP library containing nearly all possible combinations of nucleotides from the 3rd to the 5th codon positions in the protein coding region of the mRNA, it was previously demonstrated that some nucleotide combinations increased GFP expression up to four orders of magnitude. While it is clear that the codon region from positions 3 to 5 can influence protein expression levels of artificial constructs, its impact on endogenous proteins is still unknown. Through bioinformatics analysis, we identified the nucleotide combinations of the GFP library in Escherichia coli genes and examined the correlation between the expected levels of translation according to the GFP data with the experimental measures of protein expression. We observed that E. coli genes were enriched with the nucleotide compositions that enhanced protein expression in the GFP library, but surprisingly, it seemed to affect the translation efficiency only marginally. Nevertheless, our data indicate that different enterobacteria present similar nucleotide composition enrichment as E. coli, suggesting an evolutionary pressure towards the conservation of short translational enhancer sequences.

Keywords: Translational ramp; bacteria; ribosome profiling; translation elongation; translational efficiency.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • 5' Untranslated Regions
  • Base Sequence
  • Biological Evolution
  • Codon / chemistry
  • Codon / metabolism*
  • Computational Biology / methods
  • Enhancer Elements, Genetic
  • Escherichia coli / genetics*
  • Escherichia coli / metabolism
  • Gene Expression Regulation, Bacterial*
  • Gene Library
  • Genes, Bacterial*
  • Genes, Reporter
  • Green Fluorescent Proteins / genetics
  • Green Fluorescent Proteins / metabolism
  • Nucleic Acid Conformation
  • Nucleotide Motifs
  • Open Reading Frames
  • Peptide Chain Initiation, Translational*
  • Ribosomes / genetics
  • Ribosomes / metabolism

Substances

  • 5' Untranslated Regions
  • Codon
  • Green Fluorescent Proteins

Grants and funding

This work was supported by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq), Fundação Carlos Chagas Filho de Amparo à Pesquisa do Estado do Rio de Janeiro (FAPERJ) and Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES).