Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Feb;53(1):247-263.
doi: 10.3758/s13428-020-01440-1.

What is semantic diversity and why does it facilitate visual word recognition?

Affiliations

What is semantic diversity and why does it facilitate visual word recognition?

Benedetta Cevoli et al. Behav Res Methods. 2021 Feb.

Abstract

Previous research has speculated that semantic diversity and lexical ambiguity may be closely related constructs. Our research sought to test this claim in respect of the semantic diversity measure proposed by Hoffman et al. (2013). To this end, we replicated the procedure described by Hoffman et al., Behavior Research Methods, 45(3), 718-730 (2013) for computing multidimensional representations of contextual information using Latent Semantic Analysis, and from these we derived semantic diversity values for 28,555 words. We then replicated the facilitatory effect of semantic diversity on word recognition using existing data resources and observed this effect to be greater for low-frequency words. Yet, we found no relationship between this measure and lexical ambiguity effects in word recognition. Further analysis of the LSA-based contextual representations used to compute Hoffman et al. (2013) measure of semantic diversity revealed that they do not capture the distinct meanings of ambiguous words. Instead, these contextual representations appear to capture general information about the topics and types of written material in which words occur. These analyses suggest that the semantic diversity metric previously proposed by Hoffman et al. (2013) facilitates word recognition because high-diversity words are likely to have been encountered no matter what one has read, whereas many participants may not have encountered lower-diversity words simply because the topics and types of written material in which they occur are more restricted.

Keywords: Latent semantic analysis; Lexical ambiguity; Semantic diversity; Word frequency.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Illustration of semantic diversity procedure
Fig. 2
Fig. 2
Scatter plots of resulting semantic diversity measures on x-axes and the norms reported by Hoffman et al. (2013) and the y-axes. On the left, values obtained following the preprocessing procedure described in the methods (lemmatised corpus, exclusion of stop words, etc.), while on the right, values obtained following Hoffman et al. (2013) preprocessing procedure. On the top row (blue) are the measures obtained with the classical output of LSA (weighting by the singular values), while on the bottom row (grey) are the measures obtained without considering the singular values
Fig. 3
Fig. 3
Model estimates of the effect of semantic diversity by frequency on reaction time data as a function of database
Fig. 4
Fig. 4
Results of the simulation analysis of Experiment 1 of Rodd et al. (2002) on reaction time data of BLP and ELP. Both datasets show that increasing number of senses speeds performance, while increasing number of meanings slows performance
Fig. 5
Fig. 5
Results of the simulation analysis of Experiment 1 of Rodd et al. (2002) showing no difference in semantic diversity for words with many or few senses and meanings
Fig. 6
Fig. 6
Results of the simulation analysis of Experiment 2 of Rodd et al. (2002) on response time data from the BLP and ELP. Data show that an increased number of senses speeds lexical decision latency, but that there is no effect of the number of meanings
Fig. 7
Fig. 7
Results of the simulation analysis of Experiment 2 of Rodd et al. (2002) showing no difference in semantic diversity values for words with many or few senses or meanings
Fig. 8
Fig. 8
Descriptive bar plots of response time data (left) by type of ambiguity (pooled between all experimental conditions) as reported by Armstrong & Plaut (2016) and bar plots of replication analysis of BLP and ELP (middle and right, respectively) showing a polysemy advantage but no homonymy disadvantage
Fig. 9
Fig. 9
Results of the simulation analysis of Armstrong & Plaut (2016) on semantic diversity measures showing no difference across ambiguity type
Fig. 10
Fig. 10
t-SNE plots of the context vectors in which the word calf occurs
Fig. 11
Fig. 11
t-SNE plots of the context vectors in which the word mole occurs
Fig. 12
Fig. 12
t-SNE plots of the context vectors in which the word pupil occurs
Fig. 13
Fig. 13
t-SNE plots of the whole corpus labelled by domain on the top (variance ratio: 320.74), while on the bottom are the same labels randomly assigned for comparison (M = 1.00, SD = 0.04 for 1000 iterations)
Fig. 14
Fig. 14
t-SNE plots of the whole corpus labelled by type of written material on the top (variance ratio: 301.12), while on the bottom are the same labels randomly assigned for comparison (M = 1.00, SD = 0.05 for 1000 iterations)

Comment in

References

    1. Adelman JS, Brown GDA, Quesada JF. Contextual diversity, not word frequency, determines word-naming and lexical decision times. Psychological Science. 2006;17(9):814–823. doi: 10.1111/j.1467-9280.2006.01787.x. - DOI - PubMed
    1. Armstrong BC, Plaut DC. Disparate semantic ambiguity effects from semantic processing dynamics rather than qualitative task differences. Language, Cognition and Neuroscience. 2016;31(7):940–966. doi: 10.1080/23273798.2016.1171366. - DOI
    1. Baayen RH. Corpus linguistics and naive discriminative learning. Revista Brasileira de Linguística Aplicada. 2011;11(2):295–328. doi: 10.1590/s1984-63982011000200003. - DOI
    1. Baayen, R. H., Chuang, Y.-Y., Shafaei-Bajestan, E., & Blevins, J. P. (2019). The Discriminative Lexicon: A Unified Computational Model for the Lexicon and Lexical Processing in Comprehension and Production Grounded Not in (De)Composition but in Linear Discriminative Learning. Hindawi. Com. 10.1155/2019/4895891
    1. Baayen RH, Milin P. Analyzing reaction times. International Journal of Psychological Research. 2010;3(2):12. doi: 10.21500/20112084.807. - DOI

LinkOut - more resources