Spoken word production: a theory of lexical access

W J Levelt¹

Affiliations

PMID: 11698690
PMCID: PMC60894
DOI: 10.1073/pnas.231459498

Spoken word production: a theory of lexical access

W J Levelt. Proc Natl Acad Sci U S A. 2001.

. 2001 Nov 6;98(23):13464-71.

doi: 10.1073/pnas.231459498.

Author

W J Levelt¹

Affiliation

¹ Max Planck Institute for Psycholinguistics, P.O. Box 310, 6500 AH, Nijmegen, The Netherlands. pim@mpi.nl

PMID: 11698690
PMCID: PMC60894
DOI: 10.1073/pnas.231459498

Abstract

A core operation in speech production is the preparation of words from a semantic base. The theory of lexical access reviewed in this article covers a sequence of processing stages beginning with the speaker's focusing on a target concept and ending with the initiation of articulation. The initial stages of preparation are concerned with lexical selection, which is zooming in on the appropriate lexical item in the mental lexicon. The following stages concern form encoding, i.e., retrieving a word's morphemic phonological codes, syllabifying the word, and accessing the corresponding articulatory gestures. The theory is based on chronometric measurements of spoken word production, obtained, for instance, in picture-naming tasks. The theory is largely computationally implemented. It provides a handle on the analysis of multiword utterance production as well as a guide to the analysis and design of neuroimaging studies of spoken utterance production.

PubMed Disclaimer

Figures

**Figure 1**
Serial two-system architecture of the theory: two stages of lexical selection followed by three stages of form encoding.

**Figure 2**
Fragment of the WEAVER++ lexical selection network. (*Upper* stratum) Lexical concept nodes. (*Lower* stratum) Lemma and gender nodes.

**Figure 3**
Fragment of the WEAVER++ form encoding network (*Left*) with corresponding form-processing stages (*Right*). (*Upper* stratum) Nodes representing morphemic phonological codes and their phonemic “spellouts.” (*Lower* stratum) Nodes representing syllabic articulatory scores.

**Figure 4**
(A) Experimental visual pattern to be described. Pattern is flat on the table in front of the subject. Description starts from the arrow. On the outside, directional terms used in “intrinsic” perspective. On the inside, terms used in ”relative” perspective. (B) A typical subject's drawing of the same pattern from listening to an intrinsic description.

**Figure 5**
Effects of a semantic distracter on picture-naming (A) and picture categorization latencies (B) for different SOAs of picture and visual distracter words. The semantic effect is the naming latency when the distracter is semantically related to the picture name minus the latency when the distracter word is unrelated to the picture name. Black squares represent data from Glaser and Düngelhoff (21). Open squares represent WEAVER++ simulations (6).

**Figure 6**
Additive effects of semantic competition and gender facilitation in lexical selection. The target utterance, a Dutch adjective-noun phrase, is produced in response to an English probe word. The target nouns in an experimental block are either semantically homogeneous or heterogeneous. The syntactic gender of the target nouns in a block is either homogeneous (all gender 1 or all gender 2) or heterogeneous (both genders mixed). Data is from Vigliocco *et al.* (28).

**Figure 7**
Effects of a phonological distracter on picture-naming latencies, for different SOAs between picture onset and onset of spoken distracter. All picture names and distracter words were disyllabic. (A) Distracter and target words share their first syllables. (B) Distracter and target words share their last syllables. The phonological effect is the naming latency when the distracter is phonologically related to the target name minus the naming latency when the distracter is unrelated. Black squares represent data from Meyer and Schriefers (36). Open squares represent WEAVER++ simulations (7).

**Figure 8**
Visual scan during a scene description. As soon as the two-object picture appears, the subject makes a saccade from the fixation point to the left object. After some scanning of that object, the gaze shifts to the right one. VT is the total duration of fixating on an object. The utterance produced here is “baby and dog.”

**Figure 9**
A metaanalysis of 58 neuroimaging studies of word production. Colors denote regions whose activations correspond to theoretical processing stages as indicated. Contributions of insula, subcortical regions, and cerebellum are not shown. [Reproduced with permission from ref. 64 (Copyright 2000, MIT Press, Cambridge, MA).]

See this image and copyright information in PMC

References

1. Wundt W. Die Sprache. Leipzig, Germany: Engelmann; 1900.
1. Meringer R, Mayer K. Versprechen und Verlesen. Stuttgart: Goschen; 1896.
1. Levelt W J M. Trends Cogn Sci. 1999;3:223–232. - PubMed
1. Levelt W J M. Speaking: From Intention to Articulation. Cambridge, MA: MIT Press; 1989.
1. Miller G A. The Science of Words. New York: Scientific American; 1991.

MeSH terms

Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Spoken word production: a theory of lexical access

Affiliation

Spoken word production: a theory of lexical access

Author

Affiliation

Abstract

Figures

References

MeSH terms

LinkOut - more resources

Full Text Sources