Translational Morphosyntax: Distribution of Negation in Clinical Records and Biomedical Journal Articles

Stud Health Technol Inform. 2017:245:346-350.

Abstract

Prior knowledge of the distributional characteristics of linguistic phenomena can be useful for a variety of language processing tasks. This paper describes the distribution of negation in two types of biomedical texts: scientific journal articles and progress notes. Two types of negation are examined: explicit negation at the syntactic level and affixal negation at the sub-word level. The data show that the distribution of negation is significantly different in the two document types, with explicit negation more frequent in the clinical documents than in the scientific publications and affixal negation more frequent in the journal articles at the type level and token levels. All code is available on GitHub <fnr rid="fn001" /><fn id="fn001">https://github.com/KevinBretonnelCohen/NegationDistribution </fn>.

Keywords: Data Mining; Linguistics; Natural Language Processing.

MeSH terms

  • Data Mining
  • Electronic Health Records
  • Humans
  • Language
  • Linguistics*
  • Natural Language Processing*
  • Publishing