Unimodal and cross-modal identity judgements using an audio-visual sorting task: Evidence for independent processing of faces and voices

Nadine Lavan; Harriet M J Smith; Carolyn McGettigan

doi:10.3758/s13421-021-01198-7

Unimodal and cross-modal identity judgements using an audio-visual sorting task: Evidence for independent processing of faces and voices

Mem Cognit. 2022 Jan;50(1):216-231. doi: 10.3758/s13421-021-01198-7. Epub 2021 Jul 12.

Authors

Nadine Lavan^#^{1

2}, Harriet M J Smith^#³, Carolyn McGettigan⁴

Affiliations

¹ Department of Speech, Hearing and Phonetic Sciences, University College London, London, UK. [email protected].
² Department of Biological and Experimental Psychology, School of Biological and Chemical Sciences Queen Mary University of London, Mile End Road, London, E1 4NS, UK. [email protected].
³ Department of Psychology, Nottingham Trent University, Nottingham, NG1 4FQ, UK. [email protected].
⁴ Department of Speech, Hearing and Phonetic Sciences, University College London, London, UK.

^# Contributed equally.

Abstract

Unimodal and cross-modal information provided by faces and voices contribute to identity percepts. To examine how these sources of information interact, we devised a novel audio-visual sorting task in which participants were required to group video-only and audio-only clips into two identities. In a series of three experiments, we show that unimodal face and voice sorting were more accurate than cross-modal sorting: While face sorting was consistently most accurate followed by voice sorting, cross-modal sorting was at chancel level or below. In Experiment 1, we compared performance in our novel audio-visual sorting task to a traditional identity matching task, showing that unimodal and cross-modal identity perception were overall moderately more accurate than the traditional identity matching task. In Experiment 2, separating unimodal from cross-modal sorting led to small improvements in accuracy for unimodal sorting, but no change in cross-modal sorting performance. In Experiment 3, we explored the effect of minimal audio-visual training: Participants were shown a clip of the two identities in conversation prior to completing the sorting task. This led to small, nonsignificant improvements in accuracy for unimodal and cross-modal sorting. Our results indicate that unfamiliar face and voice perception operate relatively independently with no evidence of mutual benefit, suggesting that extracting reliable cross-modal identity information is challenging.

Keywords: Cross-modal; Face; Identity perception; Sorting; Unimodal; Voice.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Auditory Perception*
Humans
Recognition, Psychology*
Voice*

Abstract

Publication types

MeSH terms

Grants and funding