IRWRLDA: improved random walk with restart for lncRNA-disease association prediction

Oncotarget. 2016 Sep 6;7(36):57919-57931. doi: 10.18632/oncotarget.11141.

Abstract

In recent years, accumulating evidences have shown that the dysregulations of lncRNAs are associated with a wide range of human diseases. It is necessary and feasible to analyze known lncRNA-disease associations, predict potential lncRNA-disease associations, and provide the most possible lncRNA-disease pairs for experimental validation. Considering the limitations of traditional Random Walk with Restart (RWR), the model of Improved Random Walk with Restart for LncRNA-Disease Association prediction (IRWRLDA) was developed to predict novel lncRNA-disease associations by integrating known lncRNA-disease associations, disease semantic similarity, and various lncRNA similarity measures. The novelty of IRWRLDA lies in the incorporation of lncRNA expression similarity and disease semantic similarity to set the initial probability vector of the RWR. Therefore, IRWRLDA could be applied to diseases without any known related lncRNAs. IRWRLDA significantly improved previous classical models with reliable AUCs of 0.7242 and 0.7872 in two known lncRNA-disease association datasets downloaded from the lncRNADisease database, respectively. Further case studies of colon cancer and leukemia were implemented for IRWRLDA and 60% of lncRNAs in the top 10 prediction lists have been confirmed by recent experimental reports.

Keywords: cancer; disease; lncRNAs; random walk with restart.

MeSH terms

  • Algorithms
  • Area Under Curve
  • Colonic Neoplasms / genetics*
  • Colonic Neoplasms / metabolism
  • Computational Biology / methods*
  • Computer Simulation
  • Databases, Genetic
  • Genetic Predisposition to Disease
  • Humans
  • Leukemia / genetics*
  • Leukemia / metabolism
  • Models, Statistical
  • Normal Distribution
  • Probability
  • RNA, Long Noncoding / genetics*
  • Reproducibility of Results

Substances

  • RNA, Long Noncoding