Knowledge retrieval from PubMed abstracts and electronic medical records with the Multiple Sclerosis Ontology

PLoS One. 2015 Feb 9;10(2):e0116718. doi: 10.1371/journal.pone.0116718. eCollection 2015.

Abstract

Background: In order to retrieve useful information from scientific literature and electronic medical records (EMR) we developed an ontology specific for Multiple Sclerosis (MS).

Methods: The MS Ontology was created using scientific literature and expert review under the Protégé OWL environment. We developed a dictionary with semantic synonyms and translations to different languages for mining EMR. The MS Ontology was integrated with other ontologies and dictionaries (diseases/comorbidities, gene/protein, pathways, drug) into the text-mining tool SCAIView. We analyzed the EMRs from 624 patients with MS using the MS ontology dictionary in order to identify drug usage and comorbidities in MS. Testing competency questions and functional evaluation using F statistics further validated the usefulness of MS ontology.

Results: Validation of the lexicalized ontology by means of named entity recognition-based methods showed an adequate performance (F score = 0.73). The MS Ontology retrieved 80% of the genes associated with MS from scientific abstracts and identified additional pathways targeted by approved disease-modifying drugs (e.g. apoptosis pathways associated with mitoxantrone, rituximab and fingolimod). The analysis of the EMR from patients with MS identified current usage of disease modifying drugs and symptomatic therapy as well as comorbidities, which are in agreement with recent reports.

Conclusion: The MS Ontology provides a semantic framework that is able to automatically extract information from both scientific literature and EMR from patients with MS, revealing new pathogenesis insights as well as new clinical information.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Antineoplastic Agents / therapeutic use
  • Antirheumatic Agents / therapeutic use
  • Biological Ontologies*
  • Computational Biology / methods
  • Electronic Health Records*
  • Fingolimod Hydrochloride / therapeutic use
  • Humans
  • Immunosuppressive Agents / therapeutic use
  • Information Storage and Retrieval*
  • Knowledge Discovery
  • Mitoxantrone / therapeutic use
  • Multiple Sclerosis / classification*
  • Multiple Sclerosis / drug therapy
  • PubMed*
  • Rituximab / therapeutic use

Substances

  • Antineoplastic Agents
  • Antirheumatic Agents
  • Immunosuppressive Agents
  • Rituximab
  • Mitoxantrone
  • Fingolimod Hydrochloride

Grants and funding

Funded by the Instituto de Salud Carlos III, Spain (FIS program PS09/00259 and RETICS program RD12/0032/0001), www.isciii.es. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.