RWCFusion: identifying phenotype-specific cancer driver gene fusions based on fusion pair random walk scoring method

Oncotarget. 2016 Sep 20;7(38):61054-61068. doi: 10.18632/oncotarget.11064.

Abstract

While gene fusions have been increasingly detected by next-generation sequencing (NGS) technologies based methods in human cancers, these methods have limitations in identifying driver fusions. In addition, the existing methods to identify driver gene fusions ignored the specificity among different cancers or only considered their local rather than global topology features in networks. Here, we proposed a novel network-based method, called RWCFusion, to identify phenotype-specific cancer driver gene fusions. To evaluate its performance, we used leave-one-out cross-validation in 35 cancers and achieved a high AUC value 0.925 for overall cancers and an average 0.929 for signal cancer. Furthermore, we classified 35 cancers into two classes: haematological and solid, of which the haematological got a highly AUC which is up to 0.968. Finally, we applied RWCFusion to breast cancer and found that top 13 gene fusions, such as BCAS3-BCAS4, NOTCH-NUP214, MED13-BCAS3 and CARM-SMARCA4, have been previously proved to be drivers for breast cancer. Additionally, 8 among the top 10 of the remaining candidate gene fusions, such as SULF2-ZNF217, MED1-ACSF2, and ACACA-STAC2, were inferred to be potential driver gene fusions of breast cancer by us.

Keywords: cancer; driver; gene fusion; network.

MeSH terms

  • Area Under Curve
  • Breast Neoplasms / genetics*
  • Cell Line, Tumor
  • Female
  • Gene Expression Profiling
  • Gene Expression Regulation, Neoplastic*
  • Gene Fusion*
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Oncogene Proteins, Fusion / genetics
  • Phenotype
  • Protein Interaction Mapping
  • Sequence Analysis, RNA

Substances

  • Oncogene Proteins, Fusion