Coreference resolution: a review of general methodologies and applications in the clinical domain

J Biomed Inform. 2011 Dec;44(6):1113-22. doi: 10.1016/j.jbi.2011.08.006. Epub 2011 Aug 12.

Abstract

Coreference resolution is the task of determining linguistic expressions that refer to the same real-world entity in natural language. Research on coreference resolution in the general English domain dates back to 1960s and 1970s. However, research on coreference resolution in the clinical free text has not seen major development. The recent US government initiatives that promote the use of electronic health records (EHRs) provide opportunities to mine patient notes as more and more health care institutions adopt EHR. Our goal was to review recent advances in general purpose coreference resolution to lay the foundation for methodologies in the clinical domain, facilitated by the availability of a shared lexical resource of gold standard coreference annotations, the Ontology Development and Information Extraction (ODIE) corpus.

Publication types

  • Research Support, N.I.H., Extramural
  • Review

MeSH terms

  • Electronic Health Records
  • Humans
  • Information Storage and Retrieval
  • Linguistics
  • Medical Informatics / methods*
  • Natural Language Processing*