Integration of Multiple Genomic and Phenotype Data to Infer Novel miRNA-Disease Associations

PLoS One. 2016 Feb 5;11(2):e0148521. doi: 10.1371/journal.pone.0148521. eCollection 2016.

Abstract

MicroRNAs (miRNAs) play an important role in the development and progression of human diseases. The identification of disease-associated miRNAs will be helpful for understanding the molecular mechanisms of diseases at the post-transcriptional level. Based on different types of genomic data sources, computational methods for miRNA-disease association prediction have been proposed. However, individual source of genomic data tends to be incomplete and noisy; therefore, the integration of various types of genomic data for inferring reliable miRNA-disease associations is urgently needed. In this study, we present a computational framework, CHNmiRD, for identifying miRNA-disease associations by integrating multiple genomic and phenotype data, including protein-protein interaction data, gene ontology data, experimentally verified miRNA-target relationships, disease phenotype information and known miRNA-disease connections. The performance of CHNmiRD was evaluated by experimentally verified miRNA-disease associations, which achieved an area under the ROC curve (AUC) of 0.834 for 5-fold cross-validation. In particular, CHNmiRD displayed excellent performance for diseases without any known related miRNAs. The results of case studies for three human diseases (glioblastoma, myocardial infarction and type 1 diabetes) showed that all of the top 10 ranked miRNAs having no known associations with these three diseases in existing miRNA-disease databases were directly or indirectly confirmed by our latest literature mining. All these results demonstrated the reliability and efficiency of CHNmiRD, and it is anticipated that CHNmiRD will serve as a powerful bioinformatics method for mining novel disease-related miRNAs and providing a new perspective into molecular mechanisms underlying human diseases at the post-transcriptional level. CHNmiRD is freely available at http://www.bio-bigdata.com/CHNmiRD.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology / methods
  • Gene Ontology
  • Gene Regulatory Networks
  • Genetic Predisposition to Disease*
  • Genome, Human
  • Humans
  • MicroRNAs / genetics*
  • Models, Genetic
  • Phenotype
  • Protein Interaction Maps / genetics
  • ROC Curve
  • Random Allocation
  • Reproducibility of Results

Substances

  • MicroRNAs

Grants and funding

This work was supported in part by the National Natural Science Foundation of China (Grant Nos. 31500675), the Postdoctoral Foundation of Heilongjiang Province (Grant No. LBH–Z15162), the Natural Science Foundation of Heilongjiang Province (Grant No. QC2013C019) and the Science Foundation of Heilongjiang Provincial Health Department (Grant Nos. 2013126). There was no additional external funding received for this study.