Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 14:7:e20.
doi: 10.1017/ehs.2025.10011. eCollection 2025.

Quantifying and explaining the rise of fiction

Affiliations

Quantifying and explaining the rise of fiction

Edgar Dubourg et al. Evol Hum Sci. .

Abstract

We present a comprehensive analysis of the rise of fictions across human narratives, using large-scale datasets that collectively span over 65,000 works across various media (movies, literary works), cultures (over 30 countries, Western and non-Western), and time periods (2000 BCE to 2020 CE). We measured fictiveness - defined as the degree of departure from reality - across three narrative dimensions: protagonists, events, and settings. We used automatic annotations from large language models (LLMs) to systematically score fictiveness and ensured the robustness and validity of our measure, specifically by demonstrating predictable variations in fictiveness across different genres, in all media. Statistical analyses of the changes in fictiveness over time revealed a steady increase, culminating in the 20th and 21st centuries, across all narrative forms. Remarkably, this trend is also evident in our data spanning ancient times: fictiveness increased gradually in narratives dating back as far as 2000 BCE, with notable peaks of fictiveness during affluent periods such as the heights of the Roman Empire, the Tang Dynasty, and the European Renaissance. We explore potential psychological explanations for the rise in fictiveness, including changing audience preferences driven by ecological and social changes.

Keywords: cultural ecology; cultural evolution; fiction.

PubMed Disclaimer

Conflict of interest statement

Authors declare no conflict of interest.

Figures

None
Graphical abstract
Figure 1.
Figure 1.
Data extraction and annotation process, with the minimal version of the scale (see SM for the full scales and the prompt).
Figure 2.
Figure 2.
Examples of films (from IMDb) and literary works (from Babel), alongside their fictiveness scores for each referent, the overall (averaged) fictiveness score, and a selected excerpt from GPT’s generated output for one chosen referent (indicated by the colour of the text).
Figure 3.
Figure 3.
(A) Comparisons of fictiveness scores across referents in all five datasets. (B) Comparisons of fictiveness across datasets. (C) Comparisons of fictiveness across genres in four datasets where genres were available. In each graph, genres are ordered from higher to lower average fictiveness. For displaying significance, the overall fictiveness of adjacent genres is compared using a t-test (see SM for full statistics).
Figure 4.
Figure 4.
Correlations between personality traits and socio-demographic characteristics of the audiences who ‘liked’ movies on Facebook (N = 3.5 million), in function of the fictiveness of the movies (N = 690 movies).
Figure 5.
Figure 5.
(A) Evolution of fictiveness across time in IMDb. (B) Evolution of fictiveness across time and languages in Babel (with varying y-axis scaling).
Figure 6.
Figure 6.
(A) Forest plot of standardized regression coefficients predicting worldwide gross income from budget, duration, year, fictiveness, and the interaction of fictiveness and year. Points represent standardized effect sizes; horizontal lines show 95% confidence intervals. Model 1 includes budget, duration, and year (green); Model 2 adds fictiveness (orange); Model 3 adds the interaction between fictiveness and year (purple). (B) Plot of the interaction effect of year and fictiveness on gross income. Top: predicted values from the regression model. Bottom: actual values, displaying the distribution of log worldwide gross income over time across three bins of fictiveness (with regression lines representing linear model fits between year of release and log worldwide gross income for each bin of fictiveness across time).
Figure 7.
Figure 7.
Evolution of fictiveness across linguistic regions. Note that in our analysis, we used a linear model to capture the overall trend of increasing or decreasing fictiveness over time. However, in these graphs, we present a LOESS regression line, which provides a smoother visualization, allowing for the exploration of more fine-grained variations in the data.

References

    1. Abdurahman, S., Atari, M., Karimi-Malekabadi, F., Xue, M. J., Trager, J., Park, P. S., Golazizian, P., Omrani, A., & Dehghani, M. (2023). Perils and opportunities in using large language models in psychological research. OSF Preprints. 10.31219/osf.io/tg79n - DOI - PMC - PubMed
    1. Agapītós, P. A., & Mortensen, L. B. (2012). Medieval narratives between history and fiction: From the centre to the periphery of Europe, c. 1100–1400. Museum Tusculanum Press [distributor].
    1. Aikhenvald, A. Y. (2004). Evidentiality. Oxford University Press.
    1. Altay, S., Hacquin, A.-S., & Mercier, H. (2020). Why do so few people share fake news? It hurts their reputation. New Media & Society, 22, 1–22. 10.1177/1461444820969893 - DOI
    1. André, J.-B., Debove, S., Fitouchi, L., & Baumard, N. (2022). Moral cognition as a Nash product maximizer: An evolutionary contractualist account of morality. Preprint. PsyArXiv. 10.31234/osf.io/2hxgu - DOI

LinkOut - more resources