Accurate predictions of classification biomarkers and disease status are indispensable for clinical cancer diagnosis and research. However, the robustness of conventional gene biomarkers is limited by issues with reproducibility across different measurement platforms and cohorts of patients. In this study, we collected 4775 samples from 12 different cancer datasets, which contained 4636 TCGA samples and 139 GEO samples. A new method was developed to detect miRNA-mediated subpathway activities by using directed random walk (miDRW). To calculate the activity of each miRNA-mediated subpathway, we constructed a global directed pathway network (GDPN) with genes as nodes. We then identified miRNAs with expression levels which were strongly inversely correlated with differentially expressed target genes in the GDPN. Finally, each miRNA-mediated subpathway activity was integrated with the topological information, differential levels of miRNAs and genes, expression levels of genes, and target relationships between miRNAs and genes. The results showed that the proposed method yielded a more robust and accurate overall performance compared with other existing pathway-based, miRNA-based, and gene-based classification methods. The high-frequency miRNA-mediated subpathways are more reliable in classifying samples and for selecting therapeutic strategies.
Keywords: cancer biomarker; classification; miRNA-mediated subpathway; topological information.
© 2019 The Authors. Published by FEBS Press and John Wiley & Sons Ltd.