Diagnosing phenotypes of single-sample individuals by edge biomarkers

J Mol Cell Biol. 2015 Jun;7(3):231-41. doi: 10.1093/jmcb/mjv025. Epub 2015 Apr 26.

Abstract

Network or edge biomarkers are a reliable form to characterize phenotypes or diseases. However, obtaining edges or correlations between molecules for an individual requires measurement of multiple samples of that individual, which are generally unavailable in clinical practice. Thus, it is strongly demanded to diagnose a disease by edge or network biomarkers in one-sample-for-one-individual context. Here, we developed a new computational framework, EdgeBiomarker, to integrate edge and node biomarkers to diagnose phenotype of each single test sample. By applying the method to datasets of lung and breast cancer, it reveals new marker genes/gene-pairs and related sub-networks for distinguishing earlier and advanced cancer stages. Our method shows advantages over traditional methods: (i) edge biomarkers extracted from non-differentially expressed genes achieve better cross-validation accuracy of diagnosis than molecule or node biomarkers from differentially expressed genes, suggesting that certain pathogenic information is only present at the level of network and under-estimated by traditional methods; (ii) edge biomarkers categorize patients into low/high survival rate in a more reliable manner; (iii) edge biomarkers are significantly enriched in relevant biological functions or pathways, implying that the association changes in a network, rather than expression changes in individual molecules, tend to be causally related to cancer development. The new framework of edge biomarkers paves the way for diagnosing diseases and analyzing their molecular mechanisms by edges or networks in one-sample-for-one-individual basis. This also provides a powerful tool for precision medicine or big-data medicine.

Keywords: big biological data; disease diagnosis; edge biomarker; edge feature; progressive stages.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenocarcinoma / diagnosis*
  • Adenocarcinoma / genetics
  • Adenocarcinoma / mortality
  • Adenocarcinoma of Lung
  • Biomarkers, Tumor / genetics*
  • Breast Neoplasms / diagnosis*
  • Breast Neoplasms / genetics
  • Breast Neoplasms / mortality
  • Computational Biology
  • Early Diagnosis
  • Humans
  • Lung Neoplasms / diagnosis*
  • Lung Neoplasms / genetics
  • Lung Neoplasms / mortality
  • Neoplasm Metastasis
  • Phenotype
  • Proportional Hazards Models

Substances

  • Biomarkers, Tumor