Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Dec 5;120(49):e2309350120.
doi: 10.1073/pnas.2309350120. Epub 2023 Nov 30.

Uncovering the semantics of concepts using GPT-4

Affiliations

Uncovering the semantics of concepts using GPT-4

Gaël Le Mens et al. Proc Natl Acad Sci U S A. .

Abstract

The ability of recent Large Language Models (LLMs) such as GPT-3.5 and GPT-4 to generate human-like texts suggests that social scientists could use these LLMs to construct measures of semantic similarity that match human judgment. In this article, we provide an empirical test of this intuition. We use GPT-4 to construct a measure of typicality-the similarity of a text document to a concept. We evaluate its performance against other model-based typicality measures in terms of the correlation with human typicality ratings. We conduct this comparative analysis in two domains: the typicality of books in literary genres (using an existing dataset of book descriptions) and the typicality of tweets authored by US Congress members in the Democratic and Republican parties (using a novel dataset). The typicality measure produced with GPT-4 meets or exceeds the performance of the previous state-of-the art typicality measure we introduced in a recent paper [G. Le Mens, B. Kovács, M. T. Hannan, G. Pros Rius, Sociol. Sci. 2023, 82-117 (2023)]. It accomplishes this without any training with the research data (it is zero-shot learning). This is a breakthrough because the previous state-of-the-art measure required fine-tuning an LLM on hundreds of thousands of text documents to achieve its performance.

Keywords: LLM; categories; chatGPT; deep learning; typicality.

PubMed Disclaimer

Conflict of interest statement

Competing interests statement:The authors declare no competing interest.

Figures

Fig. 1.
Fig. 1.
Using GPT-4 to measure the genre typicality of a book based on its description: The aggregate typicality measure produced with GPT-4 is highly correlated with the average of human typicality ratings. Leftmost panel: All books in the test data for the Mystery genre. Center Left panel: Separate plots for Mystery and Non-Mystery books. Center Right panel: All books in the test data for the Romance genre. Rightmost panel: Separate plots for Romance and Non-Romance books.
Fig. 2.
Fig. 2.
Using GPT-4 to measure the typicality of a tweet in a political party: The aggregate typicality measure produced with GPT-4 is highly correlated with the average of human typicality ratings. Leftmost panel: All tweets in the test data for typicality in the Democratic Party. Center left panel: Separate plots for tweets by Democratic and Republican Congress members. Center right panel: All tweets in the test data for typicality in the Republican Party. Rightmost panel: Separate plots for tweets by Democratic and Republican Congress members.

References

    1. Hsu G., Jacks of all trades and masters of none: Audiences’ reactions to spanning genres in feature film production. Adm. Sci. Q. 51, 420–450 (2006).
    1. Leahey E., Not by productivity alone: How visibility and specialization contribute to academic earnings. Am. Sociol. Rev. 72, 533–561 (2007).
    1. Leung M. D., Dilettante or renaissance man? How the order of job experiences affects perceptions of ability in an external labor market Am. Sociol. Rev. 79, 136–158 (2014).
    1. Uzzi B., Mukherjee S., Stringer M., Jones B., Atypical combinations and scientific impact. Science 342, 468–472 (2013). - PubMed
    1. Rosch E. H., Cognitive representations of semantic categories. J. Exp. Psychol.: General 104, 192–233 (1975).

LinkOut - more resources