Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Sep-Oct;104(5):922-925.
doi: 10.3813/AAA.919244.

Revisiting Models of Concurrent Vowel Identification: The Critical Case of No Pitch Differences

Affiliations

Revisiting Models of Concurrent Vowel Identification: The Critical Case of No Pitch Differences

Samuel S Smith et al. Acta Acust United Acust. 2018 Sep-Oct.

Abstract

When presented with two vowels simultaneously, humans are often able to identify the constituent vowels. Computational models exist that simulate this ability, however they predict listener confusions poorly, particularly in the case where the two vowels have the same fundamental frequency. Presented here is a model that is uniquely able to predict the combined representation of concurrent vowels. The given model is able to predict listener's systematic perceptual decisions to a high degree of accuracy.

Keywords: 43.66.-x; 43.66Ba; 43.71.-k; 43.71.Es; 43.72.-p; 43.72.Qr; PACS no. 4371.An.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
A diagram describing our model of CVI.
Figure 2.
Figure 2.
A scatter plot comparing the probabilities with which humans predicted concurrent vowel pairs had been presented, against probabilities predicted from the combined-class (ο; σ2 = 1.03) and individual-class (×; σ2 = 1.20) variants of our spectral model. The probabilities of confusing /ɝ,ɝ/ for /u.ɝ/, and correctly identifying /a,æ/, are indicated.
Figure 3.
Figure 3.
Correlation coefficients (r) between predicted confusions for model variants, and listener confusions. ‘Sp’: Spectral pathway, ‘Te’: Temporal pathway. [11],[12],[13] are references to different cochlea filter-shapes. a) Individual classes, b) Combined classes, c) Combined classes with non-linear cochlear model [13].
Figure 4.
Figure 4.
(Colour online) Average number of concurrent vowels correctly predicted as a function of internal noise, for variants of our model. [11],[ 12],[ 13] reference different cochlea filter-shapes.

References

    1. Scheffers MTM: Sifting vowels: Auditory pitch analysis and sound segregation. Ph.D. thesis, University of Groningen; 1983.
    1. Micheyl C and Oxenham AJ: Pitch, harmonicity and concurrent sound segregation: Psychoacoustical and neurophysiological findings. Hear Res. 266 (2010) 36–51. - PMC - PubMed
    1. Assmann PF and Summerfield Q: Modeling the perception of concurrent vowels: Vowels with the same fundamental frequency. J Acoust Soc Am. 85 (1989) 327–338. - PubMed
    1. Assmann PF and Summerfield Q: Modeling the perception of concurrent vowels: vowels with different fundamental frequencies. J Acoust Soc Am. 88 (1990) 680–697. - PubMed
    1. Meddis R and Hewitt MJ: Modeling the identification of concurrent vowels with different fundamental frequencies. J Acoust Soc Am. 91 (1992) 233–245. - PubMed

LinkOut - more resources