Cloud-based archived metabolomics data: A resource for in-source fragmentation/annotation, meta-analysis and systems biology

Anal Sci Adv. 2020 Jun;1(1):70-80. doi: 10.1002/ansa.202000042. Epub 2020 Jun 13.

Abstract

Archived metabolomics data represent a broad resource for the scientific community. However, the absence of tools for the meta-analysis of heterogeneous data types makes it challenging to perform direct comparisons in a single and cohesive workflow. Here we present a framework for the meta-analysis of metabolic pathways and interpretation with proteomic and transcriptomic data. This framework facilitates the comparison of heterogeneous types of metabolomics data from online repositories (e.g., XCMS Online, Metabolomics Workbench, GNPS, and MetaboLights) representing tens of thousands of studies, as well as locally acquired data. As a proof of concept, we apply the workflow for the meta-analysis of i) independent colon cancer studies, further interpreted with proteomics and transcriptomics data, ii) multimodal data from Alzheimer's disease and mild cognitive impairment studies, demonstrating its high-throughput capability for the systems level interpretation of metabolic pathways. Moreover, the platform has been modified for improved knowledge dissemination through a collaboration with Metabolomics Workbench and LIPID MAPS. We envision that this meta-analysis tool will help overcome the primary bottleneck in analyzing diverse datasets and facilitate the full exploitation of archival metabolomics data for addressing a broad array of questions in metabolism research and systems biology.

Keywords: Archived data; meta-analysis; metabolic pathways; metabolomics; proteomics; systems biology; transcriptomics.