Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Nov 1;41(6):10.1016/j.wocn.2013.09.003.
doi: 10.1016/j.wocn.2013.09.003.

Auditory free classification of nonnative speech

Affiliations

Auditory free classification of nonnative speech

Eriko Atagi et al. J Phon. .

Abstract

Through experience with speech variability, listeners build categories of indexical speech characteristics including categories for talker, gender, and dialect. The auditory free classification task-a task in which listeners freely group talkers based on audio samples-has been a useful tool for examining listeners' representations of some of these characteristics including regional dialects and different languages. The free classification task was employed in the current study to examine the perceptual representation of nonnative speech. The category structure and salient perceptual dimensions of nonnative speech were investigated from two perspectives: general similarity and perceived native language background. Talker intelligibility and whether native talkers were included were manipulated to test stimulus set effects. Results showed that degree of accent was a highly salient feature of nonnative speech for classification based on general similarity and on perceived native language background. This salience, however, was attenuated when listeners were listening to highly intelligible stimuli and attending to the talkers' native language backgrounds. These results suggest that the context in which nonnative speech stimuli are presented-such as the listeners' attention to the talkers' native language and the variability of stimulus intelligibility-can influence listeners' perceptual organization of nonnative speech.

Keywords: classification; foreign-accented speech; native language background; perceptual similarity.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Two-dimensional MDS solutions for the Multi-sentence (A) and Same-sentence (B) conditions in which listeners grouped talkers by overall perceived similarity. Each point on the MDS solution represents a talker and is labeled with a unique talker ID as indicated in Appendix A. Each talker ID includes information about the talker’s native language background (indicated with the first letter of the language name). Male talkers are indicated by open triangles; filled circles indicate female talkers.
Figure 2
Figure 2
Classification performance by perceived native language background across all four conditions (Hi-NonNatOnly, Lo-NonNatOnly, Hi-NativeIncl, Lo-NativeIncl), in difference scores. “Hi/Lo” indicate the condition’s overall intelligibility level; “NativeIncl” conditions are where native talkers were included, and “NonNatOnly” conditions are when the stimuli set consisted only of nonnative talkers. Error bars indicate standard error.
Figure 3
Figure 3
Classification matrices indicating rates of correct pairings (cells on the diagonal) and of incorrect pairings (cells off of the diagonal) between each of the native language backgrounds presented to listeners in each condition. Each cell indicates the rate at which talkers with the two language backgrounds of the corresponding row and column were classified together. Each cell contains a score between 0 and 100, where 0 indicates that no talker of a given native language background was paired with any talker from the other native language background. The diagonal indicates rates of correct native language grouping (i.e., “hits”); the off-diagonal cells are the rates of incorrect pairings (i.e., “false alarms”). An off-diagonal score of 100 would indicate that all listeners grouped all four talkers from one native language background with all four talkers from the other native language background. Numbers in bold indicate native language pairings that were numerically above chance (i.e., greater than 16.7 for Hi-NonNatOnly and Lo-NonNatOnly; greater than 14.3 for Hi-NativeIncl and Lo-NativeIncl).
Figure 4
Figure 4
Two-dimensional MDS solutions for four conditions in Experiment 2, in which listeners grouped talkers by perceived native language background. Each point on the MDS solution represents a talker and is labeled with a unique talker ID as indicated in Appendix A. Each talker ID includes information about the talker’s native language background (indicated with the first letter of the language name). Male talkers are indicated by open triangles; filled circles indicate female talkers.
Figure 5
Figure 5
ADDTREE clustering solutions across all four conditions.

References

    1. Atagi E, Bent T. Perceptual dimensions of nonnative speech. In: Lee W-S, Zee E, editors. International Congress of Phonetic Sciences XVII. Hong Kong: City University of Hong Kong; 2011. pp. 260–263.
    1. Bent T. Native and non-native speech database for children. Journal of the Acoustical Society of America. 2010;127:1905.
    1. Best CT, McRoberts GW, Goodell E. Discrimination of non-native consonant contrasts varying in perceptual assimilation to the listener’s native phonological system. Journal of the Acoustical Society of America. 2001;109:775–794. - PMC - PubMed
    1. Bradlow AR, Clopper CG, Smiljanic R, Walter MA. A perceptual phonetic similarity space for languages: Evidence from five native language listener groups. Speech Communication. 2010;52:930–942. - PMC - PubMed
    1. Clopper CG. Doctoral dissertation. Indiana University Bloomington; 2004. Linguistic experience and the perceptual classification of dialect variation.

LinkOut - more resources