Integrating untargeted metabolomics, genetically informed causal inference, and pathway enrichment to define the obesity metabolome

Yu-Han H Hsu; Christina M Astley; Joanne B Cole; Sailaja Vedantam; Josep M Mercader; Andres Metspalu; Krista Fischer; Kristen Fortney; Eric K Morgen; Clicerio Gonzalez; Maria E Gonzalez; Tonu Esko; Joel N Hirschhorn

doi:10.1038/s41366-020-0603-x

Integrating untargeted metabolomics, genetically informed causal inference, and pathway enrichment to define the obesity metabolome

Int J Obes (Lond). 2020 Jul;44(7):1596-1606. doi: 10.1038/s41366-020-0603-x. Epub 2020 May 28.

Authors

Yu-Han H Hsu^#^{1

2

3}, Christina M Astley^#^{2

3}, Joanne B Cole^{2

3

4}, Sailaja Vedantam^{2

3}, Josep M Mercader^{3

4}, Andres Metspalu⁵, Krista Fischer^{5

6}, Kristen Fortney⁷, Eric K Morgen⁷, Clicerio Gonzalez^{8

9}, Maria E Gonzalez^{8

9}, Tonu Esko^{3

5}, Joel N Hirschhorn^{10

11

12}

Affiliations

¹ Department of Genetics, Harvard Medical School, Boston, MA, USA.
² Division of Endocrinology and Center for Basic and Translational Obesity Research, Boston Children's Hospital, Boston, MA, USA.
³ Programs in Metabolism and Medical & Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA.
⁴ Diabetes Unit and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.
⁵ Estonian Genome Center, Institute of Genomics, University of Tartu, Tartu, Estonia.
⁶ Institute of Mathematics and Statistics, University of Tartu, Tartu, Estonia.
⁷ BioAge Labs, Richmond, CA, USA.
⁸ Instituto Nacional de Salud Publica, Cuernavaca, Morelos, Mexico.
⁹ Centro de Estudios en Diabetes, Mexico City, Mexico.
¹⁰ Department of Genetics, Harvard Medical School, Boston, MA, USA. [email protected].
¹¹ Division of Endocrinology and Center for Basic and Translational Obesity Research, Boston Children's Hospital, Boston, MA, USA. [email protected].
¹² Programs in Metabolism and Medical & Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA. [email protected].

^# Contributed equally.

Abstract

Background: Obesity and its associated diseases are major health problems characterized by extensive metabolic disturbances. Understanding the causal connections between these phenotypes and variation in metabolite levels can uncover relevant biology and inform novel intervention strategies. Recent studies have combined metabolite profiling with genetic instrumental variable (IV) analysis (Mendelian randomization) to infer the direction of causality between metabolites and obesity, but often omitted a large portion of untargeted profiling data consisting of unknown, unidentified metabolite signals.

Methods: We expanded upon previous research by identifying body mass index (BMI)-associated metabolites in multiple untargeted metabolomics datasets, and then performing bidirectional IV analysis to classify metabolites based on their inferred causal relationships with BMI. Meta-analysis and pathway analysis of both known and unknown metabolites across datasets were enabled by our recently developed bioinformatics suite, PAIRUP-MS.

Results: We identified ten known metabolites that are more likely to be causes (e.g., alpha-hydroxybutyrate) or effects (e.g., valine) of BMI, or may have more complex bidirectional cause-effect relationships with BMI (e.g., glycine). Importantly, we also identified about five times more unknown than known metabolites in each of these three categories. Pathway analysis incorporating both known and unknown metabolites prioritized 40 enriched (p < 0.05) metabolite sets for the cause versus effect groups, providing further support that these two metabolite groups are linked to obesity via distinct biological mechanisms.

Conclusions: These findings demonstrate the potential utility of our approach to uncover causal connections with obesity from untargeted metabolomics datasets. Combining genetically informed causal inference with the ability to map unknown metabolites across datasets provides a path to jointly analyze many untargeted datasets with obesity or other phenotypes. This approach, applied to larger datasets with genotype and untargeted metabolite data, should generate sufficient power for robust discovery and replication of causal biological connections between metabolites and various human diseases.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Body Mass Index
Causality
Computational Biology
Humans
Metabolome*
Metabolomics
Obesity / genetics
Obesity / metabolism*

Abstract

Publication types

MeSH terms

Grants and funding