Metabolomics is a leading frontier of systems science and biomedical innovation. However, metabolite identification in mass spectrometry (MS)-based global metabolomics investigations remains a formidable challenge. Moreover, lack of comprehensive spectral databases hinders accurate identification of compounds in global MS-based metabolomics. Creating experiment-derived metabolite spectral libraries tailored to each experiment is labor-intensive. Therefore, predicted spectral libraries could serve as a better alternative. User-friendly tools are much needed, as the currently available metabolomic analysis tools do not offer adequate provision for users to create or choose context-specific databases. Here, we introduce the MS2Compound, a metabolite identification tool, which can be used to generate a custom database of predicted spectra using the Competitive Fragmentation Modeling-ID (CFM-ID) algorithm, and identify metabolites or compounds from the generated database. The database generator can create databases of the model/context/species used in the metabolomics study. The MS2Compound is also powered with mS-score, a scoring function for matching raw fragment spectra to a predicted spectra database. We demonstrated that mS-score is robust in par with dot product and hypergeometric score in identifying metabolites using benchmarking datasets. We evaluated and highlight here the unique features of the MS2Compound by a re-analysis of a publicly available metabolomic dataset (MassIVE id: MSV000086784) for a complex traditional drug formulation called Triphala. In conclusion, we believe that the omics systems science and biomedical research and innovation community in the field of metabolomics will find the MS2Compound as a user-friendly analysis tool of choice to accelerate future metabolomic analyses.
Keywords: MS2Compound; bioinformatics; computational biology; data analysis; metabolite identification; metabolomics; systems science.