. 2022 Feb 8;7(1):14.

doi: 10.1186/s41235-022-00364-y.

Adapting to the algorithm: how accuracy comparisons promote the use of a decision aid

Garston Liang¹, Jennifer F Sloane², Christopher Donkin², Ben R Newell²

Affiliations

¹ School of Psychology, The University of New South Wales, Sydney, Kensington, NSW, 2052, Australia. garston.liang@gmail.com.
² School of Psychology, The University of New South Wales, Sydney, Kensington, NSW, 2052, Australia.

PMID: 35133521
PMCID: PMC8825899
DOI: 10.1186/s41235-022-00364-y

Adapting to the algorithm: how accuracy comparisons promote the use of a decision aid

Garston Liang et al. Cogn Res Princ Implic. 2022.

. 2022 Feb 8;7(1):14.

doi: 10.1186/s41235-022-00364-y.

Authors

Garston Liang¹, Jennifer F Sloane², Christopher Donkin², Ben R Newell²

Affiliations

¹ School of Psychology, The University of New South Wales, Sydney, Kensington, NSW, 2052, Australia. garston.liang@gmail.com.
² School of Psychology, The University of New South Wales, Sydney, Kensington, NSW, 2052, Australia.

PMID: 35133521
PMCID: PMC8825899
DOI: 10.1186/s41235-022-00364-y

Abstract

In three experiments, we sought to understand when and why people use an algorithm decision aid. Distinct from recent approaches, we explicitly enumerate the algorithm's accuracy while also providing summary feedback and training that allowed participants to assess their own skills. Our results highlight that such direct performance comparisons between the algorithm and the individual encourages a strategy of selective reliance on the decision aid; individuals ignored the algorithm when the task was easier and relied on the algorithm when the task was harder. Our systematic investigation of summary feedback, training experience, and strategy hint manipulations shows that further opportunities to learn about the algorithm encourage not only increased reliance on the algorithm but also engagement in experimentation and verification of its recommendations. Together, our findings emphasize the decision-maker's capacity to learn about the algorithm providing insights for how we can improve the use of decision aids.

Keywords: Algorithm; Decision aid; Feedback.

PubMed Disclaimer

Conflict of interest statement

The authors have no competing interests to declare.

Figures

**Fig. 1**
Examples of dot motion stimuli with the algorithm. In the task, a proportion of dots move along the 90°–270° axis coherently and participants judge the direction of dot movement along this axis as either left-motion or right-motion (shown in the orange arrows). Distractor dots move in straight lines but at different axes (shown in grey arrows). Note that the orange and grey arrows appear here only for illustrative purposes; they were never present for any participants in any experiment. Panel A shows the algorithm’s recommendation (left green arrow) above the stimulus. In Experiment 1a and 1b the recommendation appeared above the stimulus automatically. In Experiments 2 and 3 the algorithm appeared on screen as a green box (Panel C) unless the participant made a request for a recommendation. If requested, the algorithm loaded the recommendation during a one-second delay displaying a “loading circle” that revolved around the box (Panel B). All motion in the stimulus stopped during this loading time and then resumed once the recommendation was revealed (as in Panel A)

**Fig. 2**
Test block progression in Experiment 1a and Experiment 1b. Difficulty of the test stimuli was randomized within each block of trials. White proportion denotes the proportion of trials when the arrow/cancer cue appeared. In the control block, the cue never appeared. In the cued block, the cue appeared on a random half of the block. Though the figure may appear to show cue-absent trials appeared in the first half of the block, the actual order of cued and cue-absent trials was randomized within the block

**Fig. 3**
Proportion of correct responses during the test stage as a function of trial type and stimulus difficulty. Control trials, when the cue is never presented, are compared against cue-present trials in the cued block. Boxplots display the median and interquartile ranges with dots representing individual participants. Green intercept line represents the algorithm’s performance (i.e. 70% correct)

**Fig. 4**
Performance in the experiment for the training blocks (Panel A) and test stage (Panel B). Mean proportion correct is presented as a function of stimulus difficulty. Boxplots show median and interquartile ranges with dots representing outliers. Note that in training, stimuli were blocked by difficulty, i.e. participants underwent easier training block followed by harder training block in the easy-hard training condition. In test, difficulty was randomized for all conditions. Horizontal line in Panel B represents algorithm performance level of 70% correct. See online for colour version

**Fig. 5**
Panel A shows mean percentage of algorithm request trials as a function of condition and stimulus difficulty. Boxplot shows the median score and interquartile ranges. Panel B shows mean proportion correct with the recommendation on algorithm requested trials compared to unassisted “own decision” trials as a function of stimulus difficulty. See online for colour version

**Fig. 6**
Panel A presents the mean proportion correct as a function of difficulty. Intercept lines represent minimum incentive thresholds. An 80% average for easier images awarded $2.00. For the harder images, two out of six blocks of 65% correct awarded the additional $3.00 reward. Panel B presents the percentage of algorithm requests. Boxplots display median and interquartile ranges. As a reminder, the green *Hint* condition received a description of maximization and algorithm-performance comparison during block-feedback

**Fig. 7**
Panel A presents the correlation between algorithm requests and performance for the harder stimuli. Panel B is a histogram of the number of harder stimuli blocks in which participants reached the high-performance threshold of 65% correct. Darker shading indicates participants that received the additional payment, i.e. reached threshold in at least two blocks, lighter shading indicates no payment

**Fig. 8**
Algorithm agreement plotted as a function of whether the recommendation was correct (x-axis) and stimulus difficulty. Data grouped by condition from Experiments 2 and 3. Agreement measures the proportion of trials when the participant made the same response as recommended. Note that only trials in which the recommendation was requested is there agreement data. Boxplots display medians and interquartile ranges with dots displaying individual participant data. See online for colour version

**Fig. 9**
Algorithm requests as a function of initial experiences with the algorithm. X-axis plots the proportion of algorithm mistakes for easier stimuli in the first 20 trials of the experiment. Y-axis plots the subsequent proportion of algorithm requests for the harder stimuli. Diagonal dotted line illustrates the expected relationship that initially seeing more algorithm mistakes would result in less reliance on the algorithm. Coloured horizontal lines show line of best fit for each experiment. Proportion is plotted on the x-axis to account for the different number of trials across participants

See this image and copyright information in PMC

Cited by

How human-AI feedback loops alter human perceptual, emotional and social judgements.
Glickman M, Sharot T. Glickman M, et al. Nat Hum Behav. 2025 Feb;9(2):345-359. doi: 10.1038/s41562-024-02077-2. Epub 2024 Dec 18. Nat Hum Behav. 2025. PMID: 39695250 Free PMC article.
Assessment and management of chronic insomnia disorder: an algorithm for primary care physicians.
Selsick H, Heidbreder A, Ellis J, Ferini-Strambi L, García-Borreguero D, Leontiou C, Mak MSB, O'Regan D, Parrino L. Selsick H, et al. BMC Prim Care. 2024 Apr 26;25(1):138. doi: 10.1186/s12875-024-02381-w. BMC Prim Care. 2024. PMID: 38671358 Free PMC article.
When combinations of humans and AI are useful: A systematic review and meta-analysis.
Vaccaro M, Almaatouq A, Malone T. Vaccaro M, et al. Nat Hum Behav. 2024 Dec;8(12):2293-2303. doi: 10.1038/s41562-024-02024-1. Epub 2024 Oct 28. Nat Hum Behav. 2024. PMID: 39468277 Free PMC article.
Evaluation of the Impact of ChatGPT on the Selection of Surgical Technique in Bariatric Surgery.
Lopez-Gonzalez R, Sanchez-Cordero S, Pujol-Gebellí J, Castellvi J. Lopez-Gonzalez R, et al. Obes Surg. 2025 Jan;35(1):19-24. doi: 10.1007/s11695-024-07279-1. Epub 2024 May 17. Obes Surg. 2025. PMID: 38760650
Three Challenges for AI-Assisted Decision-Making.
Steyvers M, Kumar A. Steyvers M, et al. Perspect Psychol Sci. 2024 Sep;19(5):722-734. doi: 10.1177/17456916231181102. Epub 2023 Jul 13. Perspect Psychol Sci. 2024. PMID: 37439761 Free PMC article.

References

1. Arkes HR, Blumer C. The psychology of sunk cost. Organizational Behavior and Human Decision Processes. 1985;35:124–140. doi: 10.1016/0749-5978(85)90049-4. - DOI
1. Arkes HR, Christensen C, Lai C, Blumer C. Two methods of reducing overconfidence. Organizational Behavior and Human Decision Processes. 1987;39:133–144. doi: 10.1016/0749-5978(87)90049-5. - DOI
1. Arkes HR, Dawes RM, Christensen C. Factors influencing the use of a decision rule in a probabilistic task. Organizational Behavior and Human Decision Processes. 1986;37:93–110. doi: 10.1016/0749-5978(86)90046-4. - DOI
1. Arkes HR, Shaffer VA, Medow MA. Patients derogate physicians who use a computer-assisted diagnostic aid. Medical Decision Making. 2007;27:189–202. doi: 10.1177/0272989X06297391. - DOI - PubMed
1. Bartlett ML, McCarley JS. No effect of cue format on automation dependence in an aided signal detection task. Human Factors. 2019;61:169–190. doi: 10.1177/0018720818802961. - DOI - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Adapting to the algorithm: how accuracy comparisons promote the use of a decision aid

Affiliations

Adapting to the algorithm: how accuracy comparisons promote the use of a decision aid

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources