Literature mining supports a next-generation modeling approach to predict cellular byproduct secretion

Metab Eng. 2017 Jan:39:220-227. doi: 10.1016/j.ymben.2016.12.004. Epub 2016 Dec 13.

Abstract

The metabolic byproducts secreted by growing cells can be easily measured and provide a window into the state of a cell; they have been essential to the development of microbiology, cancer biology, and biotechnology. Progress in computational modeling of cells has made it possible to predict metabolic byproduct secretion with bottom-up reconstructions of metabolic networks. However, owing to a lack of data, it has not been possible to validate these predictions across a wide range of strains and conditions. Through literature mining, we were able to generate a database of Escherichia coli strains and their experimentally measured byproduct secretions. We simulated these strains in six historical genome-scale models of E. coli, and we report that the predictive power of the models has increased as they have expanded in size and scope. The latest genome-scale model of metabolism correctly predicts byproduct secretion for 35/89 (39%) of designs. The next-generation genome-scale model of metabolism and gene expression (ME-model) correctly predicts byproduct secretion for 40/89 (45%) of designs, and we show that ME-model predictions could be further improved through kinetic parameterization. We analyze the failure modes of these simulations and discuss opportunities to improve prediction of byproduct secretion.

Keywords: Constraint-based modeling; Escherichia coli; Genome-scale model; Literature mining; ME-model.

Publication types

  • Evaluation Study

MeSH terms

  • Biopolymers / metabolism*
  • Computer Simulation
  • Data Mining / methods*
  • Escherichia coli / metabolism*
  • Escherichia coli Proteins / metabolism*
  • Gene Expression Regulation, Bacterial / physiology
  • Metabolic Flux Analysis / methods*
  • Models, Biological*
  • Periodicals as Topic

Substances

  • Biopolymers
  • Escherichia coli Proteins