Predicting molecular mechanisms of hereditary diseases by using their tissue-selective manifestation

Mol Syst Biol. 2023 Aug 8;19(8):e11407. doi: 10.15252/msb.202211407. Epub 2023 May 26.

Abstract

How do aberrations in widely expressed genes lead to tissue-selective hereditary diseases? Previous attempts to answer this question were limited to testing a few candidate mechanisms. To answer this question at a larger scale, we developed "Tissue Risk Assessment of Causality by Expression" (TRACE), a machine learning approach to predict genes that underlie tissue-selective diseases and selectivity-related features. TRACE utilized 4,744 biologically interpretable tissue-specific gene features that were inferred from heterogeneous omics datasets. Application of TRACE to 1,031 disease genes uncovered known and novel selectivity-related features, the most common of which was previously overlooked. Next, we created a catalog of tissue-associated risks for 18,927 protein-coding genes (https://netbio.bgu.ac.il/trace/). As proof-of-concept, we prioritized candidate disease genes identified in 48 rare-disease patients. TRACE ranked the verified disease gene among the patient's candidate genes significantly better than gene prioritization methods that rank by gene constraint or tissue expression. Thus, tissue selectivity combined with machine learning enhances genetic and clinical understanding of hereditary diseases.

Keywords: data integration; genomic medicine; machine learning; omics; tissue selectivity.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Causality
  • Humans
  • Machine Learning*
  • Rare Diseases* / genetics
  • Risk Assessment