Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 24;122(25):e2319933121.
doi: 10.1073/pnas.2319933121. Epub 2025 Jun 16.

Perceptual interventions ameliorate statistical discrimination in learning agents

Affiliations

Perceptual interventions ameliorate statistical discrimination in learning agents

Edgar A Duéñez-Guzmán et al. Proc Natl Acad Sci U S A. .

Abstract

Choosing social partners is a potentially demanding task which involves paying attention to the right information while disregarding salient but possibly irrelevant features. The resultant trade-off between cost of evaluation and quality of decisions can lead to undesired bias. Information-processing abilities mediate this trade-off, where individuals with higher ability choose better partners leading to higher performance. By altering the salience of features, technology can modulate the effect of information-processing limits, potentially increasing or decreasing undesired biases. Here, we use game theory and multiagent reinforcement learning to investigate how undesired biases emerge, and how a technological layer (in the form of a perceptual intervention) between individuals and their environment can ameliorate such biases. Our results show that a perceptual intervention designed to increase the salience of outcome-relevant features can reduce bias in agents making partner choice decisions. Individuals learning with a perceptual intervention showed less bias due to decreased reliance on features that only spuriously correlate with behavior. Mechanistically, the perceptual intervention effectively increased the information-processing abilities of the individuals. Our results highlight the benefit of using multiagent reinforcement learning to model theoretically grounded social behaviors, particularly when real-world complexity prohibits fully analytical approaches.

Keywords: partner choice; perceptual interventions; reinforcement learning; statistical discrimination.

PubMed Disclaimer

Conflict of interest statement

Competing interests statement:The authors declare no competing interest.

Figures

Fig. 1.
Fig. 1.
The stages of iterative partner choice: During free mixing, individuals have access to observable features, but not directly their suitability as partners. Individuals form pairs based on what information is available to them and proceed to interact. The outcome of the interaction is a pair of payoffs (or rewards) assigned to the individuals in each of the pairs.
Fig. 2.
Fig. 2.
The boat race environment. (A) Six players (teal and purple) pair up to row boats (brown) and cross the river to access apples (green) that confer rewards. Agents can move freely on the river banks (black) but cannot walk across the river (water sprites in blue). Access to the boats is gated by barriers (gray). On the Left, we see a frame from before a race starts (semaphores are red) and agents are able to move freely. On the Right, we see a frame from after the race has started. The semaphores have turned green, and the barriers lifted to provide access to the boats. (B) Avatars are assigned a unique random “badge” at the beginning of an episode so their identity can be known throughout the episode.
Fig. 3.
Fig. 3.
Focal agent’s discrimination index D in their evaluation communities. Positive values indicate choosing partners based on their behavior, negative ones indicate choosing based on their color. (A) D7 in evaluation communities as a function of training bias. The higher the training bias, the worse the discrimination index is. On the Left, the baseline shows negative D7 overall. On the Right, the strongest perceptual intervention shows positive D7 overall. (B) D averaged over training biases is shown for the experimental conditions: no memory, baseline, and four levels of decay β of the perceptual intervention. Stronger perceptual interventions (those with less decay) result in improved D7. CIs are computed by bootstrapping each agent’s discrimination index (5,000 samples).
Fig. 4.
Fig. 4.
(A and B) The empirical values of awareness and stickiness for the focal agents, respectively. (C) The discrimination index for the analytical model, computed from Pj(8). Each figure shows a contour plot of the average discrimination index across all color biases (B) in terms of the stickiness s (on the x-axis) and awareness ω (on the y-axis). The empirical stickiness and awareness of the baseline agents and those trained with perceptual interventions are shown overlaid on the graph. (D) The average episode reward of the focal agents. Error bars correspond to SE.

Similar articles

Cited by

References

    1. Barclay P., Strategies for cooperation in biological markets, especially for humans. Evol. Hum. Behav. 34, 164–175 (2013).
    1. Brosnan S. F., De Waal F. B. M., A proximate perspective on reciprocal altruism. Hum. Nat. 13, 129–152 (2002). - PubMed
    1. Schino G., Aureli F., Reciprocal altruism in primates: Partner choice, cognition, and emotions. Adv. Study Behav. 39, 45–69 (2009).
    1. Schino G., Aureli F., Primate reciprocity and its cognitive requirements. Evol. Anthropol. Issues News Rev. 19, 130–135 (2010).
    1. Greenwald A. G., Banaji M. R., Implicit social cognition: Attitudes, self-esteem, and stereotypes. Psychol. Rev. 102, 4 (1995). - PubMed

LinkOut - more resources