Evaluation of the Accuracy of ChatGPT in Answering Clinical Questions on the Japanese Society of Hypertension Guidelines

Kenya Kusunose; Shuichiro Kashima; Masataka Sata

doi:10.1253/circj.CJ-23-0308

Evaluation of the Accuracy of ChatGPT in Answering Clinical Questions on the Japanese Society of Hypertension Guidelines

Circ J. 2023 Jun 23;87(7):1030-1033. doi: 10.1253/circj.CJ-23-0308. Epub 2023 Jun 7.

Authors

Kenya Kusunose^{1

2}, Shuichiro Kashima¹, Masataka Sata¹

Affiliations

¹ Department of Cardiovascular Medicine, Tokushima University Hospital.
² Department of Cardiovascular Medicine, Nephrology, and Neurology, Graduate School of Medicine, University of the Ryukyus.

PMID: 37286486
DOI: 10.1253/circj.CJ-23-0308

Abstract

Background: To assist healthcare providers in interpreting guidelines, clinical questions (CQ) are often included, but not always, which can make interpretation difficult for non-expert clinicians. We evaluated the ability of ChatGPT to accurately answer CQs on the Japanese Society of Hypertension Guidelines for the Management of Hypertension (JSH 2019).

Methods and results: We conducted an observational study using data from JSH 2019. The accuracy rate for CQs and limited evidence-based questions of the guidelines (Qs) were evaluated. ChatGPT demonstrated a higher accuracy rate for CQs than for Qs (80% vs. 36%, P value: 0.005).

Conclusions: ChatGPT has the potential to be a valuable tool for clinicians in the management of hypertension.

Keywords: ChatGPT; Guidelines; Hypertension; Large language models.

Publication types

Observational Study

MeSH terms

Artificial Intelligence
East Asian People*
Health Personnel
Humans
Hypertension* / diagnosis
Hypertension* / drug therapy
Reproducibility of Results
Social Media