Literature mining supports a next-generation modeling approach to predict cellular byproduct secretion

Zachary A King; Edward J O'Brien; Adam M Feist; Bernhard O Palsson

doi:10.1016/j.ymben.2016.12.004

Literature mining supports a next-generation modeling approach to predict cellular byproduct secretion

Metab Eng. 2017 Jan:39:220-227. doi: 10.1016/j.ymben.2016.12.004. Epub 2016 Dec 13.

Authors

Zachary A King¹, Edward J O'Brien¹, Adam M Feist², Bernhard O Palsson³

Affiliations

¹ Department of Bioengineering, University of California, San Diego, 9500 Gilman Drive #0412, La Jolla, CA 92093-0412, USA.
² Department of Bioengineering, University of California, San Diego, 9500 Gilman Drive #0412, La Jolla, CA 92093-0412, USA; Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, DK-2800 Kongens, Lyngby, Denmark.
³ Department of Bioengineering, University of California, San Diego, 9500 Gilman Drive #0412, La Jolla, CA 92093-0412, USA; Department of Pediatrics, University of California, San Diego, 9500 Gilman Drive #0412, La Jolla, CA 92093-0412, USA; Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, DK-2800 Kongens, Lyngby, Denmark. Electronic address: [email protected].

PMID: 27986597
DOI: 10.1016/j.ymben.2016.12.004

Abstract

The metabolic byproducts secreted by growing cells can be easily measured and provide a window into the state of a cell; they have been essential to the development of microbiology, cancer biology, and biotechnology. Progress in computational modeling of cells has made it possible to predict metabolic byproduct secretion with bottom-up reconstructions of metabolic networks. However, owing to a lack of data, it has not been possible to validate these predictions across a wide range of strains and conditions. Through literature mining, we were able to generate a database of Escherichia coli strains and their experimentally measured byproduct secretions. We simulated these strains in six historical genome-scale models of E. coli, and we report that the predictive power of the models has increased as they have expanded in size and scope. The latest genome-scale model of metabolism correctly predicts byproduct secretion for 35/89 (39%) of designs. The next-generation genome-scale model of metabolism and gene expression (ME-model) correctly predicts byproduct secretion for 40/89 (45%) of designs, and we show that ME-model predictions could be further improved through kinetic parameterization. We analyze the failure modes of these simulations and discuss opportunities to improve prediction of byproduct secretion.

Keywords: Constraint-based modeling; Escherichia coli; Genome-scale model; Literature mining; ME-model.

Publication types

Evaluation Study

MeSH terms

Biopolymers / metabolism*
Computer Simulation
Data Mining / methods*
Escherichia coli / metabolism*
Escherichia coli Proteins / metabolism*
Gene Expression Regulation, Bacterial / physiology
Metabolic Flux Analysis / methods*
Models, Biological*
Periodicals as Topic

Substances

Biopolymers
Escherichia coli Proteins