Multi-omics data integration and analysis pipeline for precision medicine: Systematic review

Comput Biol Chem. 2024 Dec:113:108254. doi: 10.1016/j.compbiolchem.2024.108254. Epub 2024 Oct 16.

Abstract

Precision medicine has gained considerable popularity since the "one-size-fits-all" approach did not seem very effective or reflective of the complexity of the human body. Subsequently, since single-omics does not reflect the complexity of the human body's inner workings, it did not result in the expected advancement in the medical field. Therefore, the multi-omics approach has emerged. The multi-omics approach involves integrating data from different omics technologies, such as DNA sequencing, RNA sequencing, mass spectrometry, and others, using computational methods and then analyzing the integrated result for different downstream analysis applications such as survival analysis, cancer classification, or biomarker identification. Most of the recent reviews were constrained to discussing one aspect of the multi-omics analysis pipeline, such as the dimensionality reduction step, the integration methods, or the interpretability aspect; however, very few provide a comprehensive review of every step of the analysis. This study aims to give an overview of the multi-omics analysis pipeline, starting with the most popular multi-omics databases used in recent literature, dimensionality reduction techniques, details the different types of data integration techniques and their downstream analysis applications, describes the most commonly used evaluation metrics, highlights the importance of model interpretability, and lastly discusses the challenges and potential future work for multi-omics data integration in precision medicine.

Keywords: Data integration; Dimensionality reduction; Interpretability; Machine learning; Multi-omics; Precision medicine.

Publication types

  • Systematic Review
  • Review

MeSH terms

  • Computational Biology
  • Genomics
  • Humans
  • Multiomics
  • Precision Medicine*