Evaluating and Modeling Social Intelligence: A Comparative Study of Human and AI Capabilities

Wang, Junqi; Zhang, Chunhui; Li, Jiapeng; Ma, Yuxi; Niu, Lixing; Han, Jiaheng; Peng, Yujia; Zhu, Yixin; Fan, Lifeng

Computer Science > Artificial Intelligence

arXiv:2405.11841 (cs)

[Submitted on 20 May 2024]

Title:Evaluating and Modeling Social Intelligence: A Comparative Study of Human and AI Capabilities

Authors:Junqi Wang, Chunhui Zhang, Jiapeng Li, Yuxi Ma, Lixing Niu, Jiaheng Han, Yujia Peng, Yixin Zhu, Lifeng Fan

View PDF

Abstract:Facing the current debate on whether Large Language Models (LLMs) attain near-human intelligence levels (Mitchell & Krakauer, 2023; Bubeck et al., 2023; Kosinski, 2023; Shiffrin & Mitchell, 2023; Ullman, 2023), the current study introduces a benchmark for evaluating social intelligence, one of the most distinctive aspects of human cognition. We developed a comprehensive theoretical framework for social dynamics and introduced two evaluation tasks: Inverse Reasoning (IR) and Inverse Inverse Planning (IIP). Our approach also encompassed a computational model based on recursive Bayesian inference, adept at elucidating diverse human behavioral patterns. Extensive experiments and detailed analyses revealed that humans surpassed the latest GPT models in overall performance, zero-shot learning, one-shot generalization, and adaptability to multi-modalities. Notably, GPT models demonstrated social intelligence only at the most basic order (order = 0), in stark contrast to human social intelligence (order >= 2). Further examination indicated a propensity of LLMs to rely on pattern recognition for shortcuts, casting doubt on their possession of authentic human-level social intelligence. Our codes, dataset, appendix and human data are released at this https URL.

Comments:	Also published in Proceedings of the Annual Meeting of the Cognitive Science Society (CogSci), 2024
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2405.11841 [cs.AI]
	(or arXiv:2405.11841v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2405.11841

Submission history

From: Lifeng Fan [view email]
[v1] Mon, 20 May 2024 07:34:48 UTC (7,479 KB)

Computer Science > Artificial Intelligence

Title:Evaluating and Modeling Social Intelligence: A Comparative Study of Human and AI Capabilities

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Evaluating and Modeling Social Intelligence: A Comparative Study of Human and AI Capabilities

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators