Matching novel face and voice identity using static and dynamic facial images

Harriet M J Smith; Andrew K Dunn; Thom Baguley; Paula C Stacey

doi:10.3758/s13414-015-1045-8

Matching novel face and voice identity using static and dynamic facial images

Atten Percept Psychophys. 2016 Apr;78(3):868-79. doi: 10.3758/s13414-015-1045-8.

Authors

Harriet M J Smith^{1

2}, Andrew K Dunn³, Thom Baguley³, Paula C Stacey³

Affiliations

¹ Nottingham Trent University, Nottingham, UK. [email protected].
² Psychology Division, Nottingham Trent University, Burton Street, Nottingham, NG1 4BU, UK. [email protected].
³ Nottingham Trent University, Nottingham, UK.

Abstract

Research investigating whether faces and voices share common source identity information has offered contradictory results. Accurate face-voice matching is consistently above chance when the facial stimuli are dynamic, but not when the facial stimuli are static. We tested whether procedural differences might help to account for the previous inconsistencies. In Experiment 1, participants completed a sequential two-alternative forced choice matching task. They either heard a voice and then saw two faces or saw a face and then heard two voices. Face-voice matching was above chance when the facial stimuli were dynamic and articulating, but not when they were static. In Experiment 2, we tested whether matching was more accurate when faces and voices were presented simultaneously. The participants saw two face-voice combinations, presented one after the other. They had to decide which combination was the same identity. As in Experiment 1, only dynamic face-voice matching was above chance. In Experiment 3, participants heard a voice and then saw two static faces presented simultaneously. With this procedure, static face-voice matching was above chance. The overall results, analyzed using multilevel modeling, showed that voices and dynamic articulating faces, as well as voices and static faces, share concordant source identity information. It seems, therefore, that above-chance static face-voice matching is sensitive to the experimental procedure employed. In addition, the inconsistencies in previous research might depend on the specific stimulus sets used; our multilevel modeling analyses show that some people look and sound more similar than others.

Keywords: Crossmodal matching; Dynamic; Face; Static; Voice.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Acoustic Stimulation
Adolescent
Adult
Aged
Auditory Perception
Choice Behavior*
Face*
Facial Recognition*
Female
Humans
Male
Middle Aged
Photic Stimulation
Voice*
Young Adult