RNA Structural Determinants of Optimal Codons Revealed by MAGE-Seq

Cell Syst. 2016 Dec 21;3(6):563-571.e6. doi: 10.1016/j.cels.2016.11.004.

Abstract

Synonymous codon choices at the beginning of genes optimize 5' RNA structures for enhanced translation initiation, but less is known about mechanisms that drive codon optimization downstream within the gene. To understand what determines codon choices across a gene, we generated 12,726 in situ codon mutants in the Escherichia coli essential gene infA and measured their fitness by combining multiplex automated genome engineering mutagenesis with amplicon deep sequencing (MAGE-seq). Correlating predicted 5' RNA structure with fitness revealed that codons even far from the start of the gene are deleterious if they disrupt the native 5' RNA conformation. These long-range structural interactions generate context-dependent rules that constrain codon choices beyond intrinsic codon preferences. Genome-wide RNA folding predictions confirm that natural codon choices far from the start codon are optimized in part to prevent disruption of native structures near the 5' UTR. Our results shed light on natural codon distributions and should improve engineering of gene expression for synthetic biology applications.

Keywords: RNA structure; codon; codon optimization; codon usage; computational biology; molecular biology; synthetic biology; systems biology.