Active Learning Pipeline to Identify Candidate Terms for a CDSS Ontology

Stud Health Technol Inform. 2024 Aug 22:316:1338-1342. doi: 10.3233/SHTI240660.

Abstract

Ontology is essential for achieving health information and information technology application interoperability in the biomedical fields and beyond. Traditionally, ontology construction is carried out manually by human domain experts (HDE). Here, we explore an active learning approach to automatically identify candidate terms from publications, with manual verification later as a part of a deep learning model training and learning process. We introduce the overall architecture of the active learning pipeline and present some preliminary results. This work is a critical and complementary component in addition to manually building the ontology, especially during the long-term maintenance stage.

Keywords: Clinical decision support system ontology; active learning; automatic keyphrase identification; deep learning; natural language processing.

MeSH terms

  • Biological Ontologies*
  • Humans
  • Problem-Based Learning
  • Supervised Machine Learning
  • Terminology as Topic
  • Vocabulary, Controlled