Challenges in adapting existing clinical natural language processing systems to multiple, diverse health care settings

J Am Med Inform Assoc. 2017 Sep 1;24(5):986-991. doi: 10.1093/jamia/ocx039.

Abstract

Objective: Widespread application of clinical natural language processing (NLP) systems requires taking existing NLP systems and adapting them to diverse and heterogeneous settings. We describe the challenges faced and lessons learned in adapting an existing NLP system for measuring colonoscopy quality.

Materials and methods: Colonoscopy and pathology reports from 4 settings during 2013-2015, varying by geographic location, practice type, compensation structure, and electronic health record.

Results: Though successful, adaptation required considerably more time and effort than anticipated. Typical NLP challenges in assembling corpora, diverse report structures, and idiosyncratic linguistic content were greatly magnified.

Discussion: Strategies for addressing adaptation challenges include assessing site-specific diversity, setting realistic timelines, leveraging local electronic health record expertise, and undertaking extensive iterative development. More research is needed on how to make it easier to adapt NLP systems to new clinical settings.

Conclusions: A key challenge in widespread application of NLP is adapting existing systems to new clinical settings.

Keywords: cancer screening; data collection; electronic health records; information dissemination; natural language processing.

MeSH terms

  • Colonoscopy*
  • Data Collection
  • Early Detection of Cancer*
  • Electronic Health Records*
  • Humans
  • Information Dissemination
  • Medical Records Systems, Computerized
  • Natural Language Processing*
  • Pathology, Clinical