A path-based computational model for long non-coding RNA-protein interaction prediction

Genomics. 2020 Mar;112(2):1754-1760. doi: 10.1016/j.ygeno.2019.09.018. Epub 2019 Oct 19.

Abstract

Recently, lncRNAs have attracted accumulating attentions because more and more experimental researches have shown lncRNA can play critical roles in many biological processes. Predicting potential interactions between lncRNAs and proteins are key to understand the lncRNAs biological functions. But traditional biological experiments are expensive and time-consuming, network similarity methods provide a powerful solution to computationally predict lncRNA-protein interactions. In this work, a novel path-based lncRNA-protein interaction (PBLPI) prediction model is proposed by integrating protein semantic similarity, lncRNA functional similarity, known human lncRNA-protein interactions, and Gaussian interaction profile kernel similarity. PBLPI model utilizes three interlinked sub-graphs to construct a heterogeneous graph, and then infers potential lncRNA-protein interactions through depth-first search algorithm. Consequently, PBLPI achieves reliable performance in the frameworks of 5-fold cross validation (average AUC is 0.9244 and AUPR is 0.6478). In the case study, we use "Mus musculus" data to further validate the reliability of PBLPI method. It is anticipated that PBLPI would become a useful tool to identify potential lncRNA-protein interactions.

Keywords: Bioinformatics; Interaction prediction; Path-based; Protein; lncRNA.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Genomics / methods*
  • Genomics / standards
  • Humans
  • Mice
  • Protein Binding
  • RNA, Long Noncoding / genetics
  • RNA, Long Noncoding / metabolism*

Substances

  • RNA, Long Noncoding