Analyzing differences between chinese and english clinical text: a cross-institution comparison of discharge summaries in two languages

Stud Health Technol Inform. 2013:192:662-6.

Abstract

Worldwide adoption of Electronic Medical Records (EMRs) databases in health care have generated an unprecedented amount of clinical data available electronically. There has been an increasing trend in US and western institutions towards collaborating with China on medical research using EMR data. However, few studies have investigated characteristics of EMR data in China and their differences with the data in US hospitals. As an initial step towards differentiating EMR data in Chinese and US systems, this study attempts to understand system and cultural differences that may exist between Chinese and English clinical documents. We collected inpatient discharge summaries from one Chinese and from three US institutions and manually analyzed three major clinical components in text: medical problems, tests, and treatments. We reported comparison results at the document level and section level and discussed potential reasons for observed differences. Documenting and understanding differences in clinical reports from the US and China EMRs are important for cross-country collaborations. Our study also provided valuable insights for developing natural language processing tools for Chinese clinical text.

Publication types

  • Comparative Study
  • Multicenter Study
  • Research Support, N.I.H., Extramural

MeSH terms

  • China
  • Documentation*
  • Electronic Health Records / classification*
  • Medical Record Linkage / methods*
  • Natural Language Processing*
  • Patient Discharge Summaries / classification*
  • Semantics
  • Translating*
  • United States
  • Vocabulary, Controlled*