Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jul 14:8:1180.
doi: 10.3389/fpsyg.2017.01180. eCollection 2017.

Familiarity and Voice Representation: From Acoustic-Based Representation to Voice Averages

Affiliations

Familiarity and Voice Representation: From Acoustic-Based Representation to Voice Averages

Maureen Fontaine et al. Front Psychol. .

Abstract

The ability to recognize an individual from their voice is a widespread ability with a long evolutionary history. Yet, the perceptual representation of familiar voices is ill-defined. In two experiments, we explored the neuropsychological processes involved in the perception of voice identity. We specifically explored the hypothesis that familiar voices (trained-to-familiar (Experiment 1), and famous voices (Experiment 2)) are represented as a whole complex pattern, well approximated by the average of multiple utterances produced by a single speaker. In experiment 1, participants learned three voices over several sessions, and performed a three-alternative forced-choice identification task on original voice samples and several "speaker averages," created by morphing across varying numbers of different vowels (e.g., [a] and [i]) produced by the same speaker. In experiment 2, the same participants performed the same task on voice samples produced by familiar speakers. The two experiments showed that for famous voices, but not for trained-to-familiar voices, identification performance increased and response times decreased as a function of the number of utterances in the averages. This study sheds light on the perceptual representation of familiar voices, and demonstrates the power of average in recognizing familiar voices. The speaker average captures the unique characteristics of a speaker, and thus retains the information essential for recognition; it acts as a prototype of the speaker.

Keywords: average; familiarity; identity; prototypes; recognition; speech; voice; vowels.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Performance in the recognition of trained-to-familiar voices. Percent correct (A) and response times (B; ms) are represented as a function of the level of averageness (i.e., number of utterances per voice average). Gray dots represent each participant's data point. The black square represents the average performance across listeners. In (A), the dotted line indicates chance level. Black lines: linear regression built using the average slope and intercept values obtained after performing the linear regression in each subject. The slope was significantly decreasing in (A) indicating that performance worsened with increasing number of utterances per average.
Figure 2
Figure 2
Performance in the recognition of famous voices. Percent correct (A) and response times (B;ms) are represented as a function of the level of averageness (i.e., number of utterances per voice average). Gray dots represent each participant's data point. The black square represents the average performance across listeners. In (A), the dotted line indicates chance level. Black lines: linear regression built using the average slope and intercept values obtained after performing the linear regression in each and every subject. For famous voices, performance increased and RTs decreased significantly with increasing number of utterances per average.

References

    1. Andics A., Mcqueen J. M., Petersson K. M., Gal V., Rudas G., Vidnyanszky Z. (2010). Neural mechanisms for voice recognition. Neuroimage 52, 1528–1540. 10.1016/j.neuroimage.2010.05.048 - DOI - PubMed
    1. Andics A., Mcqueen J. M., Van Turennout M. (2007). Phonetic content influences voice discriminability, in Proceedings of the 16th International Congress of Phonetic Sciences (ICPhS 2007), eds Trouvain J., Barry W. J. (Dudweiler: Pirrot; ), 1829–1832.
    1. Baumann O., Belin P. (2010). Perceptual scaling of voice identity: common dimensions for different vowels and speakers. Psychol. Res. 74, 110–120. 10.1007/s00426-008-0185-z - DOI - PubMed
    1. Belin P., Bestelmeyer P. E. G., Latinus M., Watson R. (2011). Understanding voice perception. Br. J. psychol. 102, 711–725. 10.1111/j.2044-8295.2011.02041.x - DOI - PubMed
    1. Blank H., Wieland N., Von Kriegstein K. (2014). Person recognition and the brain: merging evidence from patients and healthy individuals. Neurosci. Biobehav. Rev. 47, 717–734. 10.1016/j.neubiorev.2014.10.022 - DOI - PubMed

LinkOut - more resources