Higher-Order Conditioning With Simultaneous and Backward Conditioned Stimulus: Implications for Models of Pavlovian Conditioning

Arthur Prével¹, Ruth M Krebs¹

Affiliations

PMID: 34858147
PMCID: PMC8632485
DOI: 10.3389/fnbeh.2021.749517

Review

Higher-Order Conditioning With Simultaneous and Backward Conditioned Stimulus: Implications for Models of Pavlovian Conditioning

Arthur Prével et al. Front Behav Neurosci. 2021.

. 2021 Nov 11:15:749517.

doi: 10.3389/fnbeh.2021.749517. eCollection 2021.

Authors

Arthur Prével¹, Ruth M Krebs¹

Affiliation

¹ Department of Experimental Psychology, Ghent University, Ghent, Belgium.

PMID: 34858147
PMCID: PMC8632485
DOI: 10.3389/fnbeh.2021.749517

Abstract

In a new environment, humans and animals can detect and learn that cues predict meaningful outcomes, and use this information to adapt their responses. This process is termed Pavlovian conditioning. Pavlovian conditioning is also observed for stimuli that predict outcome-associated cues; a second type of conditioning is termed higher-order Pavlovian conditioning. In this review, we will focus on higher-order conditioning studies with simultaneous and backward conditioned stimuli. We will examine how the results from these experiments pose a challenge to models of Pavlovian conditioning like the Temporal Difference (TD) models, in which learning is mainly driven by reward prediction errors. Contrasting with this view, the results suggest that humans and animals can form complex representations of the (temporal) structure of the task, and use this information to guide behavior, which seems consistent with model-based reinforcement learning. Future investigations involving these procedures could result in important new insights on the mechanisms that underlie Pavlovian conditioning.

Keywords: backward conditioning; higher-order conditioning; reinforcement learning; reward prediction error; simultaneous conditioning.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**Figure 1**
Illustration of the second-order conditioning procedure. **(A)** Phase 1: First-order conditioning between a stimulus (CS1—sound) paired with an unconditioned stimulus (US—water). **(B)** Phase 2: Second-order conditioning between a second stimulus (CS2—light) paired with the previously paired stimulus CS1. **(C)** Classic results found in the second-order conditioning task with the conditioned response (CR) evoked both by CS1 and CS2. In sensory preconditioning, the procedure is similar except that phases 1 and 2 are inversed. **(D)** TD learning for the first-order conditioning phase with change in CS1’s predicted value V_CS1. Note that V_US is zero because of the absence of predicted value at the time of the US. Because R_US is positive, the pairing between CS1 and the US results in a positive δ (i.e., R_US − V_CS1 > 0), and the acquisition of predicted value from CS1 through the update of V_CS1 (*V_CS1*_(new) = *V_CS1*_(old) + *α*δ*). **(E)** TD learning for the second-order conditioning phase with change in CS2’s predicted value *V_CS2*. Note that *R_CS1* is zero because of the absence of reward at CS1. Here, the positive *V_CS1* learned during the first-order conditioning phase is sufficient to produce a positive δ (i.e., γ V_CS1 − V_CS2 > 0) and to increase the predicted value from CS2 (*V_CS2*). TD, Temporal Difference.

**Figure 2**
Illustration of simultaneous, backward, and second-order conditioning with simultaneous CS1. **(A)** Simultaneous conditioning with a stimulus (CS1—sound) presented simultaneously with an unconditioned stimulus (US—water). **(B)** Backward conditioning with CS1 presented after a US. **(C)** Second-order conditioning with simultaneous CS1: In phase 1, a stimulus CS1 is presented simultaneously with a US. In phase 2, a second stimulus CS2 is paired with CS1 through forward pairing. During the test, while CS1 will evoke low conditioned response (CR), CS2 will evoke substantial CR. According to the TD account, a low CR evoked by CS1 is expected because CS1 is not followed by the US (i.e., R_US = 0) in phase 1. In addition, a change in CS2 value (V_CS2) depends directly on CS1’s own value (V_CS1). Thus, a second-order pairing with a first-order stimulus CS1 that evokes a low CR level (and with presumably a low predicted value) should result in low responding to CS2. The evidence of substantial response to that stimulus challenges the TD account. The same holds for a model-based account of higher-order learning if the change in V_CS2 depends on CS1’s own predicted value V_CS1. Instead, it seems necessary for CS2’s predicted value to be based on US expectations to account for this finding. Note that the same pattern of results is observed for second-order conditioning with backward CS1, and for sensory preconditioning with simultaneous and backward CS1.

See this image and copyright information in PMC

References

1. Arcediano F., Escobar M., Miller R. R. (2003). Temporal integration and temporal backward associations in human and nonhuman subjects. Learn. Behav. 31, 242–256. 10.3758/bf03195986 - DOI - PubMed
1. Arcediano F., Escobar M., Miller R. R. (2005). Bidirectional associations in humans and rats. J. Exp. Psychol. Anim. Behav. Process. 31, 301–318. 10.1037/0097-7403.31.3.301 - DOI - PubMed
1. Arcediano F., Miller R. R. (2002). Some constraints for models of timing: a temporal coding hypothesis perspective. Learn. Motiv. 33, 105–123. 10.1006/lmot.2001.1102 - DOI
1. Barnet R. C., Arnold H. M., Miller R. R. (1991). Simultaneous conditioning demonstrated in second-order conditioning: evidence for similar associative structure in forward and simultaneous conditioning. Learn. Motiv. 22, 253–268. 10.1016/0023-9690(91)90008-V - DOI
1. Barnet R. C., Cole R. P., Miller R. R. (1997). Temporal integration in second-order conditioning and sensory preconditioning. Anim. Learn. Behav. 25, 221–233. 10.3758/BF03199061 - DOI

Publication types

Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Higher-Order Conditioning With Simultaneous and Backward Conditioned Stimulus: Implications for Models of Pavlovian Conditioning

Affiliation

Higher-Order Conditioning With Simultaneous and Backward Conditioned Stimulus: Implications for Models of Pavlovian Conditioning

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

References

Publication types

LinkOut - more resources

Full Text Sources