Evaluation of the Accuracy of ChatGPT in Answering Clinical Questions on the Japanese Society of Hypertension Guidelines

Circ J. 2023 Jun 23;87(7):1030-1033. doi: 10.1253/circj.CJ-23-0308. Epub 2023 Jun 7.

Abstract

Background: To assist healthcare providers in interpreting guidelines, clinical questions (CQ) are often included, but not always, which can make interpretation difficult for non-expert clinicians. We evaluated the ability of ChatGPT to accurately answer CQs on the Japanese Society of Hypertension Guidelines for the Management of Hypertension (JSH 2019).

Methods and results: We conducted an observational study using data from JSH 2019. The accuracy rate for CQs and limited evidence-based questions of the guidelines (Qs) were evaluated. ChatGPT demonstrated a higher accuracy rate for CQs than for Qs (80% vs. 36%, P value: 0.005).

Conclusions: ChatGPT has the potential to be a valuable tool for clinicians in the management of hypertension.

Keywords: ChatGPT; Guidelines; Hypertension; Large language models.

Publication types

  • Observational Study

MeSH terms

  • Artificial Intelligence
  • East Asian People*
  • Health Personnel
  • Humans
  • Hypertension* / diagnosis
  • Hypertension* / drug therapy
  • Reproducibility of Results
  • Social Media