A multi-modal framework improves prediction of tissue-specific gene expression from a surrogate tissue

EBioMedicine. 2024 Sep:107:105305. doi: 10.1016/j.ebiom.2024.105305. Epub 2024 Aug 23.

Abstract

Background: Tissue-specific analysis of the transcriptome is critical to elucidating the molecular basis of complex traits, but central tissues are often not accessible. We propose a methodology, Multi-mOdal-based framework to bridge the Transcriptome between PEripheral and Central tissues (MOTPEC).

Methods: Multi-modal regulatory elements in peripheral blood are incorporated as features for gene expression prediction in 48 central tissues. To demonstrate the utility, we apply it to the identification of BMI-associated genes and compare the tissue-specific results with those derived directly from surrogate blood.

Findings: MOTPEC models demonstrate superior performance compared with both baseline models in blood and existing models across the 48 central tissues. We identify a set of BMI-associated genes using the central tissue MOTPEC-predicted transcriptome data. The MOTPEC-based differential gene expression (DGE) analysis of BMI in the central tissues (including brain caudate basal ganglia and visceral omentum adipose tissue) identifies 378 genes overlapping the results from a TWAS of BMI, while only 162 overlapping genes are identified using gene expression in blood. Cellular perturbation analysis further supports the utility of MOTPEC for identifying trait-associated gene sets and narrowing the effect size divergence between peripheral blood and central tissues.

Interpretation: The MOTPEC framework improves the gene expression prediction accuracy for central tissues and enhances the identification of tissue-specific trait-associated genes.

Funding: This research is supported by the National Natural Science Foundation of China 82204118 (D.Z.), the seed funding of the Key Laboratory of Intelligent Preventive Medicine of Zhejiang Province (2020E10004), the National Institutes of Health (NIH) Genomic Innovator Award R35HG010718 (E.R.G.), NIH/NHGRI R01HG011138 (E.R.G.), NIH/NIA R56AG068026 (E.R.G.), NIH Office of the Director U24OD035523 (E.R.G.), and NIH/NIGMS R01GM140287 (E.R.G.).

Keywords: BMI; Cross-tissue prediction; Gene expression; Multi-omics; TWAS.

MeSH terms

  • Algorithms
  • Body Mass Index
  • Computational Biology / methods
  • Gene Expression Profiling*
  • Gene Expression Regulation
  • Humans
  • Organ Specificity* / genetics
  • Transcriptome*