Familiarity and Voice Representation: From Acoustic-Based Representation to Voice Averages

Maureen Fontaine¹, Scott A Love¹, Marianne Latinus¹

Affiliations

PMID: 28769836
PMCID: PMC5509798
DOI: 10.3389/fpsyg.2017.01180

Familiarity and Voice Representation: From Acoustic-Based Representation to Voice Averages

Maureen Fontaine et al. Front Psychol. 2017.

. 2017 Jul 14:8:1180.

doi: 10.3389/fpsyg.2017.01180. eCollection 2017.

Authors

Maureen Fontaine¹, Scott A Love¹, Marianne Latinus¹

Affiliation

¹ UMR7289, Centre National de la Recherche Scientifique, Institut de Neuroscience de la Timone, Aix-Marseille UniversitéMarseille, France.

PMID: 28769836
PMCID: PMC5509798
DOI: 10.3389/fpsyg.2017.01180

Abstract

The ability to recognize an individual from their voice is a widespread ability with a long evolutionary history. Yet, the perceptual representation of familiar voices is ill-defined. In two experiments, we explored the neuropsychological processes involved in the perception of voice identity. We specifically explored the hypothesis that familiar voices (trained-to-familiar (Experiment 1), and famous voices (Experiment 2)) are represented as a whole complex pattern, well approximated by the average of multiple utterances produced by a single speaker. In experiment 1, participants learned three voices over several sessions, and performed a three-alternative forced-choice identification task on original voice samples and several "speaker averages," created by morphing across varying numbers of different vowels (e.g., [a] and [i]) produced by the same speaker. In experiment 2, the same participants performed the same task on voice samples produced by familiar speakers. The two experiments showed that for famous voices, but not for trained-to-familiar voices, identification performance increased and response times decreased as a function of the number of utterances in the averages. This study sheds light on the perceptual representation of familiar voices, and demonstrates the power of average in recognizing familiar voices. The speaker average captures the unique characteristics of a speaker, and thus retains the information essential for recognition; it acts as a prototype of the speaker.

Keywords: average; familiarity; identity; prototypes; recognition; speech; voice; vowels.

PubMed Disclaimer

Figures

**Figure 1**
Performance in the recognition of trained-to-familiar voices. Percent correct **(A)** and response times (B; ms) are represented as a function of the level of averageness (i.e., number of utterances per voice average). Gray dots represent each participant's data point. The black square represents the average performance across listeners. In **(A)**, the dotted line indicates chance level. Black lines: linear regression built using the average slope and intercept values obtained after performing the linear regression in each subject. The slope was significantly decreasing in **(A)** indicating that performance worsened with increasing number of utterances per average.

**Figure 2**
Performance in the recognition of famous voices. Percent correct **(A)** and response times (B;ms) are represented as a function of the level of averageness (i.e., number of utterances per voice average). Gray dots represent each participant's data point. The black square represents the average performance across listeners. In **(A)**, the dotted line indicates chance level. Black lines: linear regression built using the average slope and intercept values obtained after performing the linear regression in each and every subject. For famous voices, performance increased and RTs decreased significantly with increasing number of utterances per average.

See this image and copyright information in PMC

References

1. Andics A., Mcqueen J. M., Petersson K. M., Gal V., Rudas G., Vidnyanszky Z. (2010). Neural mechanisms for voice recognition. Neuroimage 52, 1528–1540. 10.1016/j.neuroimage.2010.05.048 - DOI - PubMed
1. Andics A., Mcqueen J. M., Van Turennout M. (2007). Phonetic content influences voice discriminability, in Proceedings of the 16th International Congress of Phonetic Sciences (ICPhS 2007), eds Trouvain J., Barry W. J. (Dudweiler: Pirrot; ), 1829–1832.
1. Baumann O., Belin P. (2010). Perceptual scaling of voice identity: common dimensions for different vowels and speakers. Psychol. Res. 74, 110–120. 10.1007/s00426-008-0185-z - DOI - PubMed
1. Belin P., Bestelmeyer P. E. G., Latinus M., Watson R. (2011). Understanding voice perception. Br. J. psychol. 102, 711–725. 10.1111/j.2044-8295.2011.02041.x - DOI - PubMed
1. Blank H., Wieland N., Von Kriegstein K. (2014). Person recognition and the brain: merging evidence from patients and healthy individuals. Neurosci. Biobehav. Rev. 47, 717–734. 10.1016/j.neubiorev.2014.10.022 - DOI - PubMed

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Familiarity and Voice Representation: From Acoustic-Based Representation to Voice Averages

Affiliation

Familiarity and Voice Representation: From Acoustic-Based Representation to Voice Averages

Authors

Affiliation

Abstract

Figures

References

LinkOut - more resources

Full Text Sources

Other Literature Sources