Mining emerging biomedical literature for understanding disease associations in drug discovery

Methods Mol Biol. 2014:1159:171-206. doi: 10.1007/978-1-4939-0709-0_11.

Abstract

Systematically evaluating the exponentially growing body of scientific literature has become a critical task that every drug discovery organization must engage in in order to understand emerging trends for scientific investment and strategy development. Developing trends analysis uses the number of publications within a 3-year window to determine concepts derived from well-established disease and gene ontologies to aid in recognizing and predicting emerging areas of scientific discoveries relevant to that space. In this chapter, we describe such a method and use obesity and psoriasis as use-case examples by analyzing the frequency of disease-related MeSH terms in PubMed abstracts over time. We share how our system can be used to predict emerging trends at a relatively early stage and we analyze the literature-identified genes for genetic associations, druggability, and biological pathways to explore any potential biological connections between the two diseases that could be utilized for drug discovery.

Publication types

  • Review

MeSH terms

  • Animals
  • Data Mining / methods*
  • Drug Discovery / methods*
  • Gene Ontology*
  • Humans
  • Knowledge Discovery / methods*
  • MEDLINE
  • Medical Subject Headings