Biotea: semantics for Pubmed Central

Alexander Garcia; Federico Lopez; Leyla Garcia; Olga Giraldo; Victor Bucheli; Michel Dumontier

doi:10.7717/peerj.4201

Biotea: semantics for Pubmed Central

PeerJ. 2018 Jan 2:6:e4201. doi: 10.7717/peerj.4201. eCollection 2018.

Authors

Alexander Garcia¹, Federico Lopez², Leyla Garcia³, Olga Giraldo¹, Victor Bucheli², Michel Dumontier⁴

Affiliations

¹ Ontology Engineering Group, Universidad Politécnica de Madrid, Madrid, Spain.
² Escuela de Ingeniería de Sistemas y Computación, Universidad del Valle, Cali, Colombia.
³ Temporal Knowledge Bases Group, Department of Computer Languages and Systems, Universitat Jaume I, Castelló de la Plana, Spain.
⁴ Maastricht University, Institute of Data Science, Maastricht, The Netherlands.

Abstract

A significant portion of biomedical literature is represented in a manner that makes it difficult for consumers to find or aggregate content through a computational query. One approach to facilitate reuse of the scientific literature is to structure this information as linked data using standardized web technologies. In this paper we present the second version of Biotea, a semantic, linked data version of the open-access subset of PubMed Central that has been enhanced with specialized annotation pipelines that uses existing infrastructure from the National Center for Biomedical Ontology. We expose our models, services, software and datasets. Our infrastructure enables manual and semi-automatic annotation, resulting data are represented as RDF-based linked data and can be readily queried using the SPARQL query language. We illustrate the utility of our system with several use cases. Our datasets, methods and techniques are available at http://biotea.github.io.

Keywords: Linked data; Ontology; RDF; SPARQL; Semantic; Semantic web.

Grants and funding

Olga Ximena Giraldo has been funded by the EU project DrInventor FP7-ICT-2013.8.1. Alexander Garcia has been funded by the KOPAR project, H2020-MSCA-IF-2014, Grant Agreement nr: 655009. Federico Lopez has been funded by linkingdata.io. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.