ChatGPT's advice is perceived as better than that of professional advice columnists

Piers Douglas Lionel Howe; Nicolas Fay; Morgan Saletta; Eduard Hovy

doi:10.3389/fpsyg.2023.1281255

ChatGPT's advice is perceived as better than that of professional advice columnists

Front Psychol. 2023 Nov 21:14:1281255. doi: 10.3389/fpsyg.2023.1281255. eCollection 2023.

Authors

Piers Douglas Lionel Howe¹, Nicolas Fay², Morgan Saletta³, Eduard Hovy⁴

Affiliations

¹ Complex Human Data Hub, Melbourne School of Psychological Sciences, University of Melbourne, Melbourne, VIC, Australia.
² School of Psychological Science, University of Western Australia, Perth, WA, Australia.
³ Hunt Laboratory, University of Melbourne, Melbourne, VIC, Australia.
⁴ Melbourne Connect, University of Melbourne, Melbourne, VIC, Australia.

Abstract

ChatGPT is a high-performance large language model that has the potential to significantly improve human-computer interactions. It can provide advice on a range of topics, but it is unclear how good this advice is relative to that provided by competent humans, especially in situations where empathy is required. Here, we report the first investigation of whether ChatGPT's responses are perceived as better than those of humans in a task where humans were attempting to be empathetic. Fifty social dilemma questions were randomly selected from 10 well-known advice columns. In a pre-registered survey, participants (N = 404) were each shown one question, along with the corresponding response by an advice columnist and by ChatGPT. ChatGPT's advice was perceived as more balanced, complete, empathetic, helpful, and better than the advice provided by professional advice columnists (all values of p < 0.001). Although participants could not determine which response was written by ChatGPT (54%, p = 0.29), most participants preferred that their own social dilemma questions be answered by a human than by a computer (77%, p < 0.001). ChatGPT's responses were longer than those produced by the advice columnists (mean 280.9 words vs. 142.2 words, p < 0.001). In a second pre-registered survey, each ChatGPT answer was constrained to be approximately the same length as that of the advice columnist (mean 143.2 vs. 142.2 words, p = 0.95). This survey (N = 401) replicated the above findings, showing that the benefit of ChatGPT was not solely due to it writing longer answers.

Keywords: ChatGPT; advice; advice column; agony aunt; empathy.

Grants and funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. The research was supported by an Office of National Intelligence (ONI) and Australian Research Council (ARC) grant (NI210100224), and the Western Australian Government (Defense Science Center).