Ontologies in Big Health Data Analytics: Application to Routine Clinical Data

Stud Health Technol Inform. 2018:255:65-69.

Abstract

Ontologies are an important big-data analytics tool. Historically code lists were created by domain experts and mapped between different coding systems. Ontologies allow us to develop better representations of clinical concepts, data and facilitate better data extracts from routine clinical data. It also makes the process of case identification and key outcome measures transparent. We describe a process we have operationalised in our research. We use ontologies to resolve the semantics of complex health care data. The use of the method is demonstrated through a pregnancy case identification method. Pregnancy data are recorded in different coding systems and stored in different general practice systems; and pregnancy has its own complexities in that not all pregnancies proceed to term, they have different lengths and involve multiple providers of health care.

Keywords: Biomedical Ontologies; Controlled Vocabulary; Information Storage and Retrieval; Medical Record systems; computerized.

MeSH terms

  • Big Data*
  • Data Science
  • Health* / statistics & numerical data
  • Information Storage and Retrieval*
  • Semantics
  • Vocabulary, Controlled*