A tale of two explanations: Enhancing human trust by explaining robot behavior

Mark Edmonds; Feng Gao; Hangxin Liu; Xu Xie; Siyuan Qi; Brandon Rothrock; Yixin Zhu; Ying Nian Wu; Hongjing Lu; Song-Chun Zhu

doi:10.1126/scirobotics.aay4663

A tale of two explanations: Enhancing human trust by explaining robot behavior

Sci Robot. 2019 Dec 18;4(37):eaay4663. doi: 10.1126/scirobotics.aay4663.

Authors

Mark Edmonds¹, Feng Gao², Hangxin Liu³, Xu Xie², Siyuan Qi³, Brandon Rothrock⁴, Yixin Zhu⁵, Ying Nian Wu², Hongjing Lu^{2

6}, Song-Chun Zhu^{1

2}

Affiliations

¹ Department of Computer Science, UCLA, Los Angeles, CA 90095, USA. [email protected] [email protected] [email protected].
² Department of Statistics, UCLA, Los Angeles, CA 90095, USA.
³ Department of Computer Science, UCLA, Los Angeles, CA 90095, USA.
⁴ Jet Propulsion Laboratory, Caltech, Los Angeles, CA 91109, USA.
⁵ Department of Statistics, UCLA, Los Angeles, CA 90095, USA. [email protected] [email protected] [email protected].
⁶ Department of Psychology, UCLA, Los Angeles, CA 90095, USA.

PMID: 33137717
DOI: 10.1126/scirobotics.aay4663

Abstract

The ability to provide comprehensive explanations of chosen actions is a hallmark of intelligence. Lack of this ability impedes the general acceptance of AI and robot systems in critical tasks. This paper examines what forms of explanations best foster human trust in machines and proposes a framework in which explanations are generated from both functional and mechanistic perspectives. The robot system learns from human demonstrations to open medicine bottles using (i) an embodied haptic prediction model to extract knowledge from sensory feedback, (ii) a stochastic grammar model induced to capture the compositional structure of a multistep task, and (iii) an improved Earley parsing algorithm to jointly leverage both the haptic and grammar models. The robot system not only shows the ability to learn from human demonstrators but also succeeds in opening new, unseen bottles. Using different forms of explanations generated by the robot system, we conducted a psychological experiment to examine what forms of explanations best foster human trust in the robot. We found that comprehensive and real-time visualizations of the robot's internal decisions were more effective in promoting human trust than explanations based on summary text descriptions. In addition, forms of explanation that are best suited to foster trust do not necessarily correspond to the model components contributing to the best task performance. This divergence shows a need for the robotics community to integrate model components to enhance both task execution and human trust in machines.