Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 2:12:678712.
doi: 10.3389/fpsyg.2021.678712. eCollection 2021.

Morpho-Phonetic Effects in Speech Production: Modeling the Acoustic Duration of English Derived Words With Linear Discriminative Learning

Affiliations

Morpho-Phonetic Effects in Speech Production: Modeling the Acoustic Duration of English Derived Words With Linear Discriminative Learning

Simon David Stein et al. Front Psychol. .

Abstract

Recent evidence for the influence of morphological structure on the phonetic output goes unexplained by established models of speech production and by theories of the morphology-phonology interaction. Linear discriminative learning (LDL) is a recent computational approach in which such effects can be expected. We predict the acoustic duration of 4,530 English derivative tokens with the morphological functions DIS, NESS, LESS, ATION, and IZE in natural speech data by using predictors derived from a linear discriminative learning network. We find that the network is accurate in learning speech production and comprehension, and that the measures derived from it are successful in predicting duration. For example, words are lengthened when the semantic support of the word's predicted articulatory path is stronger. Importantly, differences between morphological categories emerge naturally from the network, even when no morphological information is provided. The results imply that morphological effects on duration can be explained without postulating theoretical units like the morpheme, and they provide further evidence that LDL is a promising alternative for modeling speech production.

Keywords: acoustic duration; derivation; linear discriminative learning; mental lexicon; morphological theory; speech production.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The handling editor declared a past co-editorship with one of the authors IP.

Figures

Figure 1
Figure 1
Comprehension and production mapping, adapted from Baayen et al. (2019b). For comprehension, transformation matrix F transforms the cue matrix C into the semantic matrix S. For production, transformation matrix G transforms the semantic matrix S into the cue matrix C.
Figure 2
Figure 2
Toy example of an articulatory path for the word lawless. Each connection between a triphone node is assigned a probability of being selected against other triphones.
Figure 3
Figure 3
Effects on duration difference in the standard linear regression models for the Idiosyncratic Network variables (left column), the Morphology Network variables (middle column) and the Base Network variables (right column).
Figure 4
Figure 4
Effects on duration difference in the mixed-effects regression models for the Idiosyncratic Network variables (left column), the Morphology Network variables (middle column) and the Base Network variables (right column). Red regression lines indicate significant effects from the final models, gray regression lines indicate non-significant effects from the initial models before the non-significant predictors were excluded.
Figure 5
Figure 5
Density distributions of variables by derivational function in the Idiosyncratic Network models (left column), the Morphology Network models (middle column), and the Base Network models (right column). Note that in the first two panels in the top tow, the density curves around 1.0 are calculated over a single value.
Figure 6
Figure 6
Type count of top 8 neighbors by network and morphological function.

References

    1. Arnold D., Tomaschek F., Sering K., Lopez F., Baayen R. H. (2017). Words from spontaneous conversational speech can be recognized with human-like accuracy by an error-driven learning algorithm that discriminates between meanings straight from smart acoustic features, bypassing the phoneme as recognition unit. PLoS ONE 12:e0174623. 10.1371/journal.pone.0174623 - DOI - PMC - PubMed
    1. Baayen R. H. (2008). Analyzing Linguistic Data: A Practical Introduction to Statistics Using R. Cambridge: Cambridge University Press.
    1. Baayen R. H., Chuang Y.-Y., Heitmeier M. (2019a). WpmWithLdl: Implementation of Word and Paradigm Morphology With Linear Discriminative Learning. R Package. Available online at: http://www.sfs.uni-tuebingen.de/~hbaayen/software.html
    1. Baayen R. H., Chuang Y.-Y., Shafaei-Bajestan E., Blevins J. P. (2019b). The discriminative lexicon. A unified computational model for the lexicon and lexical processing in comprehension and production grounded not in (de)composition but in linear discriminative learning. Complexity 2019, 1–39. 10.1155/2019/4895891 - DOI
    1. Baayen R. H., Milin P. (2010). Analyzing reaction times. Int. J. Psychol. Res. 3, 12–28. 10.21500/20112084.807 - DOI

LinkOut - more resources