Multi-omics data integration using ratio-based quantitative profiling with Quartet reference materials

Nat Biotechnol. 2024 Jul;42(7):1133-1149. doi: 10.1038/s41587-023-01934-1. Epub 2023 Sep 7.

Abstract

Characterization and integration of the genome, epigenome, transcriptome, proteome and metabolome of different datasets is difficult owing to a lack of ground truth. Here we develop and characterize suites of publicly available multi-omics reference materials of matched DNA, RNA, protein and metabolites derived from immortalized cell lines from a family quartet of parents and monozygotic twin daughters. These references provide built-in truth defined by relationships among the family members and the information flow from DNA to RNA to protein. We demonstrate how using a ratio-based profiling approach that scales the absolute feature values of a study sample relative to those of a concurrently measured common reference sample produces reproducible and comparable data suitable for integration across batches, labs, platforms and omics types. Our study identifies reference-free 'absolute' feature quantification as the root cause of irreproducibility in multi-omics measurement and data integration and establishes the advantages of ratio-based multi-omics profiling with common reference materials.

MeSH terms

  • Gene Expression Profiling / methods
  • Genomics / methods
  • Humans
  • Metabolomics* / methods
  • Multiomics
  • Proteomics / methods
  • Reference Standards
  • Transcriptome / genetics