Cohort Discovery Query Optimization via Computable Controlled Vocabulary Versioning

Stud Health Technol Inform. 2015:216:1084.

Abstract

Self-service cohort discovery tools strive to provide intuitive interfaces to large Clinical Data Warehouses that contain extensive historic information. In those tools, controlled vocabulary (e.g., ICD-9-CM, CPT) coded clinical information is often the main search criteria used because of its ubiquity in billing processes. These tools generally require a researcher to pick specific terms from the controlled vocabulary. However, controlled vocabularies evolve over time as medical knowledge changes and can even be replaced with new versions (e.g., ICD-9 to ICD-10). These tools generally only display the current version of the controlled vocabulary. Researchers should not be expected to understand the underlying controlled vocabulary versioning issues. We propose a computable controlled vocabulary versioning system that allows cohort discovery tools to automatically expand queries to account for terminology changes.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Data Accuracy*
  • Data Mining / methods*
  • Database Management Systems / organization & administration*
  • Electronic Health Records / organization & administration*
  • International Classification of Diseases*
  • Natural Language Processing*