Overlap in meaning is a stronger predictor of semantic activation in GPT-3 than in humans
- PMID: 36977744
- PMCID: PMC10050205
- DOI: 10.1038/s41598-023-32248-6
Overlap in meaning is a stronger predictor of semantic activation in GPT-3 than in humans
Abstract
Modern large language models generate texts that are virtually indistinguishable from those written by humans and achieve near-human performance in comprehension and reasoning tests. Yet, their complexity makes it difficult to explain and predict their functioning. We examined a state-of-the-art language model (GPT-3) using lexical decision tasks widely used to study the structure of semantic memory in humans. The results of four analyses showed that GPT-3's patterns of semantic activation are broadly similar to those observed in humans, showing significantly higher semantic activation in related (e.g., "lime-lemon") word pairs than in other-related (e.g., "sour-lemon") or unrelated (e.g., "tourist-lemon") word pairs. However, there are also significant differences between GPT-3 and humans. GPT-3's semantic activation is better predicted by similarity in words' meaning (i.e., semantic similarity) rather than their co-occurrence in the language (i.e., associative similarity). This suggests that GPT-3's semantic network is organized around word meaning rather than their co-occurrence in text.
© 2023. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures
References
-
- Brown, T. B. et al. (2020). Language models are few-shot learners. arXivhttp://arxiv.org/abs/2005.14165 (2020).
-
- DeepL. (n.d.). DeepL SE. https://www.DeepL.com/translator
-
- Bender, E. M., Gebru, T., McMillan-Major, A. & Shmitchell, S. On the dangers of stochastic parrots: Can language models be too big?. In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency 610–623. 10.1145/3442188.3445922 (2021).
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials
