Chemical structure identification in metabolomics: computational modeling of experimental features

Lochana C Menikarachchi; Mai A Hamdalla; Dennis W Hill; David F Grant

doi:10.5936/csbj.201302005

Chemical structure identification in metabolomics: computational modeling of experimental features

Comput Struct Biotechnol J. 2013 Mar 1:5:e201302005. doi: 10.5936/csbj.201302005. eCollection 2013.

Authors

Lochana C Menikarachchi¹, Mai A Hamdalla², Dennis W Hill¹, David F Grant¹

Affiliations

¹ Department of Pharmaceutical Sciences, University of Connecticut, 69 N Eagleville Rd, Storrs, CT 06269, United States.
² Department of Computer Science & Engineering, University of Connecticut, 371 Fairfield Road, Unit 2155 Storrs, CT 06269, United States.

Abstract

The identification of compounds in complex mixtures remains challenging despite recent advances in analytical techniques. At present, no single method can detect and quantify the vast array of compounds that might be of potential interest in metabolomics studies. High performance liquid chromatography/mass spectrometry (HPLC/MS) is often considered the analytical method of choice for analysis of biofluids. The positive identification of an unknown involves matching at least two orthogonal HPLC/MS measurements (exact mass, retention index, drift time etc.) against an authentic standard. However, due to the limited availability of authentic standards, an alternative approach involves matching known and measured features of the unknown compound with computationally predicted features for a set of candidate compounds downloaded from a chemical database. Computationally predicted features include retention index, ECOM50 (energy required to decompose 50% of a selected precursor ion in a collision induced dissociation cell), drift time, whether the unknown compound is biological or synthetic and a collision induced dissociation (CID) spectrum. Computational predictions are used to filter the initial "bin" of candidate compounds. The final output is a ranked list of candidates that best match the known and measured features. In this mini review, we discuss cheminformatics methods underlying this database search-filter identification approach.

Keywords: HPLC; QSPR; ion mobility; mass spectrometry; metabolomics; retention index.

Publication types

Review

Grants and funding

R01 GM087714/GM/NIGMS NIH HHS/United States