. 2021 Dec 17;12(1):7337.

doi: 10.1038/s41467-021-27618-5.

Sources of confidence in value-based choice

Jeroen Brus^{1

2}, Helena Aebersold³, Marcus Grueschow⁴, Rafael Polania^{5

6}

Affiliations

¹ Decision Neuroscience Lab, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland. jeroen.brus@hest.ethz.ch.
² Neuroscience Center Zurich, Zurich, Switzerland. jeroen.brus@hest.ethz.ch.
³ Epidemiology, Biostatistics and Prevention Institute, University of Zurich, Zurich, Switzerland.
⁴ Zurich Center for Neuroeconomics (ZNE), Department of Economics, University of Zurich, Zurich, Switzerland.
⁵ Decision Neuroscience Lab, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland. rafael.polania@hest.ethz.ch.
⁶ Neuroscience Center Zurich, Zurich, Switzerland. rafael.polania@hest.ethz.ch.

PMID: 34921144
PMCID: PMC8683513
DOI: 10.1038/s41467-021-27618-5

Sources of confidence in value-based choice

Jeroen Brus et al. Nat Commun. 2021.

. 2021 Dec 17;12(1):7337.

doi: 10.1038/s41467-021-27618-5.

Authors

Jeroen Brus^{1

2}, Helena Aebersold³, Marcus Grueschow⁴, Rafael Polania^{5

6}

Affiliations

¹ Decision Neuroscience Lab, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland. jeroen.brus@hest.ethz.ch.
² Neuroscience Center Zurich, Zurich, Switzerland. jeroen.brus@hest.ethz.ch.
³ Epidemiology, Biostatistics and Prevention Institute, University of Zurich, Zurich, Switzerland.
⁴ Zurich Center for Neuroeconomics (ZNE), Department of Economics, University of Zurich, Zurich, Switzerland.
⁵ Decision Neuroscience Lab, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland. rafael.polania@hest.ethz.ch.
⁶ Neuroscience Center Zurich, Zurich, Switzerland. rafael.polania@hest.ethz.ch.

PMID: 34921144
PMCID: PMC8683513
DOI: 10.1038/s41467-021-27618-5

Abstract

Confidence, the subjective estimate of decision quality, is a cognitive process necessary for learning from mistakes and guiding future actions. The origins of confidence judgments resulting from economic decisions remain unclear. We devise a task and computational framework that allowed us to formally tease apart the impact of various sources of confidence in value-based decisions, such as uncertainty emerging from encoding and decoding operations, as well as the interplay between gaze-shift dynamics and attentional effort. In line with canonical decision theories, trial-to-trial fluctuations in the precision of value encoding impact economic choice consistency. However, this uncertainty has no influence on confidence reports. Instead, confidence is associated with endogenous attentional effort towards choice alternatives and down-stream noise in the comparison process. These findings provide an explanation for confidence (miss)attributions in value-guided behaviour, suggesting mechanistic influences of endogenous attentional states for guiding decisions and metacognitive awareness of choice certainty.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Fig. 1. Experiment and regression analysis of choice consistency and confidence.**
a Example display of the rating and choice task. Participants rated their desirability to eat the displayed food item. Next participants were asked to indicate which of the two food items they preferred to consume. After their choice, participants were asked how confident they were about their decision. b Standardized estimates of a multiple logistic regression on choice consistency (green) show that higher value difference (VD) leads to more consistent choices (β = 0.28 ± 0.04, P < 0.001). Higher variability (Var) in the rating of the two alternatives leads to less consistent choices (β = −0.15 ± 0.05, P = 0.002). The total value (TV) of the two items had no reliable influence on choice consistency (β = 8.7 × 10⁻³ ± 0.05, P = 0.43). Standardized estimates of a multiple linear regression on confidence reports show that higher VD lead to more confidence (β = 0.05 ± 0.01, P < 0.001). Crucially, higher variability in the rating of the two alternatives does not have a reliable effect on confidence (β = 0.01 ± 0.02, P = 0.2). Higher TV increases confidence (β = 0.28 ± 0.06, P < 0.001). Error bars indicate the mean standard deviation of the posterior estimates. c The difference of the effect size of the influence of variability on choice consistency and confidence is significant with P < 0.01. Vertical red dashed line indicates the median, black vertical lines indicate the 95% highest density interval. d Participant’s average level of variability in the rating task had a negative influence on average choice consistency of that participant (β = −0.58 ± 0.16, P < 0.001, r = −0.53), however, this effect is not present for the same analyses performed on confidence reports (β = −0.19 ± 0.18, P = 0.15, r = −0.22), P-values are based on the highest density interval of the posterior estimates. Gray shaded areas indicate the 95% confidence bands. The difference of the effect of average variability on choice consistency and confidence ratings is significant (Δβ_cons−conf = −0.39 ± 0.24, P = 0.05). e Confidence as a function of absolute value difference shows the qualitative signatures of confidence reports guided by its statistical definition. Confidence as a function reaction time shows signatures reported in previous work. Confidence as a function of total value confirms the quantitative results presented in panel (b). Data are presented as mean values ± SEM. For the whole figure n = 33 independent participants. Source data are provided as a Source Data file.

**Fig. 2. Generative modeling of confidence: heuristic and normative.**
a Illustration of how confidence reports are generated by the decision-maker according to the heuristic process: confidence is simply computed as the difference between the decision bound and the evidence of the losing accumulator at the time of decision. b Confidence can be computed via the estimation of the expected evidence of the losing accumulator. c How the observer generates confidence reports according to the normative model: confidence is generated by computing the probability that the decision is correct given the decision time and the process model parameters. d Confidence predictions generated by the normative model as a function of RTs, evidence of the loser accumulator, and attentional effort. e Linear regression analysis of confidence comparing the heuristic (blue) versus the normative (orange) model. The results originate from two separate linear regressions, on the left Confidence ~ Correct (Cor) + VD + TV + VD*Cor + TV*Cor and on the right Confidence ~ Cor + θ + RT + θ*Cor + θ*Cor*RT. We use two linear regressions to prevent problems with high correlations between explanatory variables and to separate the input variables from the variables that are generated by the decision-maker. Bars indicate the mean of the standardized β values and error bars the standard error, stars indicate significant difference from 0 with α = 0.05. Statistics are calculated using n = 33 independent participants. Only the normative model predicts that confidence should be higher for higher values of attentional effort.

**Fig. 3. Joint modeling, the covariance approach.**
a Graphical diagram for the joint model with the covariance approach. White circular nodes represent latent variables, grey rectangular nodes represent observable variables. The variables θ_i, k_i, and choices and reaction times feed into the generative models of confidence. b Folke et al. performed similar experiments, with the key difference that the rating task is not repeated. Furthermore, subjects were asked how much they were willing to pay for a certain food item using a standard incentive-compatible Becker–DeGroot–Marschak method. c Results from the variable normative model and model comparison. Confidence is positively related to trial-to-trial fluctuations of attentional effort and evidence gain, shown for three example subjects (for all subjects see Supplementary Figs. 7 and 8). The estimated density of correlation parameters for confidence and attentional effort and for confidence and the evidence gain is bigger than zero. In both cases ρ_mcmc > 0 with P < 0.001. The vertical red dashed line indicates the median, black lines indicate the 95% confidence interval. Loo model comparison of the fixed heuristic (FH), variable heuristic (VH), fixed normative (FN), and variable normative (VN) model versions show that the VN model explains the data best. The Bayes Factor (BF) is calculated between the variable normative model and all other models, for all comparisons we find an infinite BF in favor of the variable normative model. d The same as (c), but for the Folke data. e The empirically found confidence levels as a function of (from left to right) value difference, reaction time, total value, attentional effort, and the evidence gain, split for consistent and inconsistent choice. Data are presented as mean values ± SEM. g The same as in e, but for the predictions of the variable normative model. f, h The same as in (d) and (e), but for the data of Folke et al. For the Brus et al. dataset n = 33 independent participants, for Folke et al. n = 28 independent participants. Source data are provided as a Source Data file.

**Fig. 4. The RUM decision model.**
a Sketch of the simple RUM decision model, color-coded to match the graphs. Observers infer the value of the food items by looking back and forth between choice alternatives. The subsequent comparison process is noisy. We investigate how confidence ratings influence trial-to-trial fluctuations of attentional factors and the evidence gain. b, d Comparison of parameter estimates of two alternative RUMs: a RUM with agent-specific estimates of k and θ and a RUM that allows for trial-to-trial fluctuations of k and θ. b) the median of the posterior estimate of θ of the agent-specific RUM is indicated by the horizontal blue line, the shaded grey area indicates the 95% confidence interval. The diagonal blue lines represent 100 random samples of the posterior distribution of how θ changes with confidence in the RUM allowing for trial-to-trial fluctuations. Remarkably, θ changes over its full range as a function of confidence. d) the same as b, but for k. f Left: standardized posterior estimates of the relationship between confidence and k and θ. Error bars indicate the mean posterior estimate of the standard deviation. Both β^k and β^θ are significantly bigger than zero with P < 0.001. Right: effect sizes of the results shown on the left. Error bars indicate the standard deviation of the posterior estimates of the mean of the effect size. Both the effect sizes of β^k and β^θ are significantly bigger than zero with P < 0.001. P-values are based on the highest density interval of the posterior estimates. h Left column: the empirical probabilities of choosing the upper item; up: as a function of value difference; down: as a function of the difference in dwell time. Right column: the same as left but for the predicted probabilities of choosing the upper item by the simple RUM. The trials are median split in high/low confidence. Value difference and dwell time difference are split into eight groups of equal size. Data are presented as mean values ± SEM. c, e, g, i) Same as b, d, f, h, but for the data of Folke et al. For the Brus et al. dataset n = 33 independent participants, for Folke et al. n = 28 independent participants. Source data are provided as a Source Data file.

**Fig. 5. The efficient coding model.**
a The decision process with three distinct process stages, color-coded to match the graphs. The prior matches the distribution of subjective values v of supermarket products. When choosing between two items, subjects look repeatedly at them, spending unequal time on the two options. The subjective values are internally encoded, the corresponding likelihood function $p (\hat{v} ∣ v)$ is constrained by the prior p(v) via efficient coding. Lastly, noise that occurs after the decoding is taken into account. b Standardized posterior estimates of the relationship between confidence and variance in the encoding process ( $β^{σ_{enc}}$ ), the variance in the comparison process ( $β^{σ_{comp}}$ ), and attentional factors (β^θ). ( $β^{σ_{enc}}$ ) is not significantly different from 0 (P = 0.39), both ( $β^{σ_{comp}}$ ) and (β^θ) are significantly bigger than zero with P < 0.001. The effect size of ( $β^{σ_{enc}}$ ) is not significantly different from 0. Both the effect sizes of ( $β^{σ_{comp}}$ ) and (β^θ) are significantly bigger than zero with P < 0.001. Error bars indicate the mean posterior estimate of the standard deviation. P-values are based on the highest density interval of the posterior estimates. c Left column: the empirical probabilities of choosing the upper item; up: as a function of value difference; down: as a function of the difference in dwell time. Right column: the same as left but for the predicted probabilities by the efficient coding model. The trials are median split in high/low confidence. Value difference and dwell time difference are split into eight groups of equal size. Data are presented as mean values ± SEM. Source data are provided as a Source Data file. d Comparison of parameter estimates of two alternative efficient coding models: a model with agent-specific estimates of σ_enc and a model that allows for trial-to-trial fluctuations of σ_enc. The median of the posterior estimate of σ_enc of the agent-specific model is indicated by the horizontal green line, the shaded grey area indicates the 95% confidence interval. The diagonal green lines represent 100 random samples of the posterior distribution of how σ_enc changes with confidence in the model allowing for trial-to-trial fluctuations. e, f Same as (e) but for σ_comp and θ. g Comparison of the effect sizes of the posterior estimates of σ_enc and σ_comp. Vertical red dashed line indicates the median, black lines indicate the 95% confidence interval. h Comparison of the posterior estimates of the intercept of θ in the efficient coding model and the RUM. i Comparison of the posterior estimates of the slope of θ in the efficient coding model and the RUM. For the whole figure n = 33 independent participants.

See this image and copyright information in PMC

Cited by

A population representation of the confidence in a decision in the parietal cortex.
Zylberberg A, Shadlen MN. Zylberberg A, et al. Cell Rep. 2025 Apr 22;44(4):115526. doi: 10.1016/j.celrep.2025.115526. Epub 2025 Apr 11. Cell Rep. 2025. PMID: 40208792 Free PMC article.
The asymmetric impact of decision-making confidence on regret and relief.
Liu Z. Liu Z. Front Psychol. 2024 Apr 8;15:1365743. doi: 10.3389/fpsyg.2024.1365743. eCollection 2024. Front Psychol. 2024. PMID: 38650908 Free PMC article.
Limits of Optimization.
Carissimo C, Korecki M. Carissimo C, et al. Minds Mach (Dordr). 2024;34(Suppl 1):117-137. doi: 10.1007/s11023-023-09633-1. Epub 2023 Apr 6. Minds Mach (Dordr). 2024. PMID: 38510203 Free PMC article.
Multifaceted confidence in exploratory choice.
Solopchuk O, Dayan P. Solopchuk O, et al. PLoS One. 2025 Jan 9;20(1):e0304923. doi: 10.1371/journal.pone.0304923. eCollection 2025. PLoS One. 2025. PMID: 39787073 Free PMC article.
A Population Representation of the Confidence in a Decision in the Parietal Cortex.
Zylberberg A, Shadlen MN. Zylberberg A, et al. bioRxiv [Preprint]. 2025 Feb 1:2024.08.15.608159. doi: 10.1101/2024.08.15.608159. bioRxiv. 2025. Update in: Cell Rep. 2025 Apr 22;44(4):115526. doi: 10.1016/j.celrep.2025.115526. PMID: 39229195 Free PMC article. Updated. Preprint.

See all "Cited by" articles

References

1. Bang D, et al. Confidence matching in group decision-making. Nat. Hum. Behav. 2017;1:1–7.
1. M. G. Edelson, R. Polania, C. C. Ruff, E. Fehr, and T. A. Hare. Computational and neurobiological foundations of leadership decisions. Science10.1126/science.aat0036 (2018). - PubMed
1. Camerer C, Lovallo D. Overconfidence and excess entry: an experimental approach. Am. Econ. Rev. 1999;89:306–318.
1. Folke T, Jacobsen C, Fleming SM, De Martino B. Explicit representation of confidence informs future value-based decisions. Nat. Hum. Behav. 2017;1:17–19.
1. Kiani R, Corthell L, Shadlen MN. Choice certainty is informed by both evidence and decision time. Neuron. 2014;84:1329–1342. - PMC - PubMed

Publication types

Actions

Associated data

figshare/10.6084/m9.figshare.3756144.v2

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Sources of confidence in value-based choice

Affiliations

Sources of confidence in value-based choice

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

Associated data

LinkOut - more resources

Full Text Sources