Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Sep 10;13(9):116.
doi: 10.3390/jintelligence13090116.

Measuring Emotion Perception Ability Using AI-Generated Stimuli: Development and Validation of the PAGE Test

Affiliations

Measuring Emotion Perception Ability Using AI-Generated Stimuli: Development and Validation of the PAGE Test

Ben Weidmann et al. J Intell. .

Abstract

We present a new measure of emotion perception called PAGE (Perceiving AI Generated Emotions). The test includes 20 emotions, expressed by ethnically diverse faces, spanning a wide range of ages. We created stimuli with generative AI, illustrating a method to build customizable assessments of emotional intelligence at relatively low cost. Study 1 describes the validation of the image set and test construction. Study 2 reports the psychometric properties of the test, including convergent validity and relatively strong reliability. Study 3 explores predictive validity using a lab experiment in which we causally identify the contributions managers make to teams. PAGE scores predict managers' causal contributions to group success, a finding which is robust to controlling for personality and demographic characteristics. We discuss the potential of generative AI to automate development of non-cognitive skill assessments.

Keywords: emotion perception; generative AI; management; measurement; teamwork.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure A1
Figure A1
Study 2a: Scree plot of PAGE. The plot displays the eigenvalues for the components extracted with principal component analysis (PCA). The sharp drop between the first and second components indicates a dominant one-factor structure.
Figure 1
Figure 1
Example Stimuli from the PAGE test. In this case, the emotions being represented by the four stimuli are (a) surprise, (b) contentment, (c) anger, (d) fear.
Figure 2
Figure 2
Sample item from the PAGE test. For each item, participants select one answer from six options. Definitions of emotions are provided to participants if they click on the question marks. The target emotion for this item is fear.
Figure 3
Figure 3
Distribution of PAGE scores by gender (N = 505 males and 505 females). Mean score for male is 23.4 (SD = 4.9), Mean score for female is 24.0 (SD = 5.1), mean scores indicated by the vertical lines. The mean difference between males and females is not statistically significant.
Figure 4
Figure 4
Patterns of performance for PAGE/RMET by gender/age/ethnicity (N = 741). 50% of the participants are female, and 50.5% are White. Both tests show similar performance patterns across gender, age and ethnicity. Each dot on the graphs represents one observation.
Figure 5
Figure 5
Experiment overview. Participants were randomly assigned to the role of ‘manager’ or ‘worker’. Each group completed a novel collaborative problem-solving task in which managers assigned tasks, monitored group progress, and motivated teammates. After each task, managers were randomly reassigned to new groups, managing a total of four different groups. The repeated random assignment of managers to teams enabled the identification of each manager’s contribution to group performance.

References

    1. Acheampong Alex, Owusu-Manu De-Graft, Kissi Ernest, Tetteh Portia Atswei. Assessing the Influence of Emotional Intelligence (EI) on Project Performance in Developing Countries: The Case of Ghana. International Journal of Construction Management. 2023;23:1163–73. doi: 10.1080/15623599.2021.1958279. - DOI
    1. Ambadar Zara, Schooler Jonathan W., Cohn Jeffrey F. Deciphering the Enigmatic Face: The Importance of Facial Dynamics in Interpreting Subtle Facial Expressions. Psychological Science. 2005;16:403–10. doi: 10.1111/j.0956-7976.2005.01548.x. - DOI - PubMed
    1. Artsi Yaara, Sorin Vera, Konen Eli, Glicksberg Benjamin S., Nadkarni Girish, Klang Eyal. Large Language Models for Generating Medical Examinations: Systematic Review. BMC Medical Education. 2024;24:354. doi: 10.1186/s12909-024-05239-y. - DOI - PMC - PubMed
    1. Aryadoust Vahid, Zakaria Azrifah, Jia Yichen. Investigating the Affordances of OpenAI’s Large Language Model in Developing Listening Assessments. Computers and Education: Artificial Intelligence. 2024;6:100204. doi: 10.1016/j.caeai.2024.100204. - DOI
    1. Baron-Cohen Simon, Wheelwright Sally, Hill Jacqueline, Raste Yogini, Plumb Ian. The ‘Reading the Mind in the Eyes’ Test Revised Version: A Study with Normal Adults, and Adults with Asperger Syndrome or High-functioning Autism. Journal of Child Psychology and Psychiatry. 2001;42:241–51. doi: 10.1017/s0021963001006643. - DOI - PubMed

LinkOut - more resources