Case Repositories: Towards Case-Based Reasoning for AI Alignment

Feng, K. J. Kevin; Chen, Quan Ze; Cheong, Inyoung; Xia, King; Zhang, Amy X.

Computer Science > Artificial Intelligence

arXiv:2311.10934 (cs)

[Submitted on 18 Nov 2023 (v1), last revised 26 Nov 2023 (this version, v3)]

Title:Case Repositories: Towards Case-Based Reasoning for AI Alignment

Authors:K. J. Kevin Feng, Quan Ze Chen, Inyoung Cheong, King Xia, Amy X. Zhang

View PDF

Abstract:Case studies commonly form the pedagogical backbone in law, ethics, and many other domains that face complex and ambiguous societal questions informed by human values. Similar complexities and ambiguities arise when we consider how AI should be aligned in practice: when faced with vast quantities of diverse (and sometimes conflicting) values from different individuals and communities, with whose values is AI to align, and how should AI do so? We propose a complementary approach to constitutional AI alignment, grounded in ideas from case-based reasoning (CBR), that focuses on the construction of policies through judgments on a set of cases. We present a process to assemble such a case repository by: 1) gathering a set of ``seed'' cases -- questions one may ask an AI system -- in a particular domain, 2) eliciting domain-specific key dimensions for cases through workshops with domain experts, 3) using LLMs to generate variations of cases not seen in the wild, and 4) engaging with the public to judge and improve cases. We then discuss how such a case repository could assist in AI alignment, both through directly acting as precedents to ground acceptable behaviors, and as a medium for individuals and communities to engage in moral reasoning around AI.

Comments:	MP2 workshop @ NeurIPS 2023
Subjects:	Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2311.10934 [cs.AI]
	(or arXiv:2311.10934v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2311.10934

Submission history

From: K. J. Kevin Feng [view email]
[v1] Sat, 18 Nov 2023 02:02:40 UTC (1,092 KB)
[v2] Tue, 21 Nov 2023 06:25:19 UTC (1,092 KB)
[v3] Sun, 26 Nov 2023 21:07:10 UTC (1,092 KB)

Computer Science > Artificial Intelligence

Title:Case Repositories: Towards Case-Based Reasoning for AI Alignment

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Case Repositories: Towards Case-Based Reasoning for AI Alignment

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators