Improving heart failure information extraction by domain adaptation

Youngjun Kim; Jennifer Garvin; Julia Heavirland; Stéphane M Meystre

Improving heart failure information extraction by domain adaptation

Stud Health Technol Inform. 2013:192:185-9.

Authors

Youngjun Kim¹, Jennifer Garvin, Julia Heavirland, Stéphane M Meystre

Affiliation

¹ School of Computing, University of Utah, Salt Lake City, Utah, U.S.

PMID: 23920541

Abstract

Adapting an information extraction application to a new domain (e.g., new categories of narrative text) typically requires re-training the application with the new narratives. But could previous training from the original domain alleviate this adaptation? After having developed an NLP-based application to extract congestive heart failure treatment performance measures from echocardiogram reports (i.e., the source domain), we adapted it to a large variety of clinical documents (i.e., the target domain). We wanted to reuse the machine learning trained models from the source domain, and experimented with several popular domain adaptation approaches such as reusing the predictions from the source model, or applying a linear interpolation. As a result, we measured higher recall and precision (92.4% and 95.3% respectively) than when training with the target domain only.

Publication types

Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Artificial Intelligence*
Heart Failure / diagnosis*
Humans
Medical Record Linkage / methods*
Medical Records Systems, Computerized*
Natural Language Processing*
Pattern Recognition, Automated / methods
Semantics
Systems Integration
Terminology as Topic*
Utah
Vocabulary, Controlled*