Achieving the predictable expression of heterologous genes in a production host has proven difficult. Each heterologous gene expressed in the same host seems to elicit a different host response governed by unknown mechanisms. Historically, most studies have approached this challenge by manipulating the properties of the heterologous gene through methods like codon optimization. Here we approach this challenge from the host side. We express a set of 45 heterologous genes in the same Escherichia coli strain, using the same expression system and culture conditions. We collect a comprehensive RNAseq set to characterize the host's transcriptional response. Independent Component Analysis of the RNAseq data set reveals independently modulated gene sets (iModulons) that characterize the host response to heterologous gene expression. We relate 55% of variation of the host response to: Fear vs Greed (16.5%), Metal Homeostasis (19.0%), Respiration (6.0%), Protein folding (4.5%), and Amino acid and nucleotide biosynthesis (9.0%). If these responses can be controlled, then the success rate with predicting heterologous gene expression should increase.
Keywords: Big data; Heterologous gene expression; Host cell response; Independent component analysis; Metabolic burden; Plasmid.
Copyright © 2020 The Authors. Published by Elsevier Inc. All rights reserved.