Re-evaluating Theory of Mind evaluation in large language models
- PMID: 40808448
- PMCID: PMC12351311
- DOI: 10.1098/rstb.2023.0499
Re-evaluating Theory of Mind evaluation in large language models
Abstract
The question of whether large language models (LLMs) possess Theory of Mind (ToM)-often defined as the ability to reason about others' mental states-has sparked significant scientific and public interest. However, the evidence as to whether LLMs possess ToM is mixed, and the recent growth in evaluations has not resulted in a convergence. Here, we take inspiration from cognitive science to re-evaluate the state of ToM evaluation in LLMs. We argue that a major reason for the disagreement on whether LLMs have ToM is a lack of clarity on whether models should be expected to match human behaviours, or the computations underlying those behaviours. We also highlight ways in which current evaluations may be deviating from 'pure' measurements of ToM abilities, which also contributes to the confusion. We conclude by discussing several directions for future research, including the relationship between ToM and pragmatic communication, which could advance our understanding of artificial systems as well as human cognition.This article is part of the theme issue 'At the heart of human communication: new views on the complex relationship between pragmatics and Theory of Mind'.
Keywords: Theory of Mind; cognitive science; language models; pragmatics.
Conflict of interest statement
We declare we have no competing interests.
References
-
- Premack D, Woodruff G. 1978. Does the chimpanzee have a theory of mind? Behav. Brain Sci. 1, 515–526. ( 10.1017/s0140525x00076512) - DOI
-
- Apperly I. 2011. Mindreaders: the cognitive basis of ‘theory of mind.’ New York, NY: Psychology Press.
-
- Clark HH, Marshall CR. 1981. Definite reference and mutual knowledge. In Elements of discourse understanding (eds Joshi A, Webber B, Sag I), pp. 10–63. Cambridge, UK: Cambridge University Press.
-
- Brennan SE, Galati A, Kuhlen AK. 2010. Two minds, one dialog: coordinating speaking and understanding. In Psychology of learning and motivation (ed. Ross BH), pp. 301–344, vol. 53. Elsevier. ( 10.1016/S0079-7421(10)53008-1) - DOI
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources