. 2020 Nov 4;16(11):e1008304.

doi: 10.1371/journal.pcbi.1008304. eCollection 2020 Nov.

PPM-Decay: A computational model of auditory prediction with memory decay

Peter M C Harrison^{1

2}, Roberta Bianco³, Maria Chait³, Marcus T Pearce^{2

4}

Affiliations

¹ Computational Auditory Perception Research Group, Max Planck Institute for Empirical Aesthetics, Frankfurt, Germany.
² Cognitive Science Research Group, Queen Mary University of London, London, UK.
³ UCL Ear Institute, University College London, London, UK.
⁴ Department of Clinical Medicine, Aarhus University, Aarhus, Denmark.

PMID: 33147209
PMCID: PMC7668605
DOI: 10.1371/journal.pcbi.1008304

PPM-Decay: A computational model of auditory prediction with memory decay

Peter M C Harrison et al. PLoS Comput Biol. 2020.

. 2020 Nov 4;16(11):e1008304.

doi: 10.1371/journal.pcbi.1008304. eCollection 2020 Nov.

Authors

Peter M C Harrison^{1

2}, Roberta Bianco³, Maria Chait³, Marcus T Pearce^{2

4}

Affiliations

¹ Computational Auditory Perception Research Group, Max Planck Institute for Empirical Aesthetics, Frankfurt, Germany.
² Cognitive Science Research Group, Queen Mary University of London, London, UK.
³ UCL Ear Institute, University College London, London, UK.
⁴ Department of Clinical Medicine, Aarhus University, Aarhus, Denmark.

PMID: 33147209
PMCID: PMC7668605
DOI: 10.1371/journal.pcbi.1008304

Erratum in

Correction: PPM-Decay: A computational model of auditory prediction with memory decay.
Harrison PMC, Bianco R, Chait M, Pearce MT. Harrison PMC, et al. PLoS Comput Biol. 2021 May 26;17(5):e1008995. doi: 10.1371/journal.pcbi.1008995. eCollection 2021 May. PLoS Comput Biol. 2021. PMID: 34038404 Free PMC article.

Abstract

Statistical learning and probabilistic prediction are fundamental processes in auditory cognition. A prominent computational model of these processes is Prediction by Partial Matching (PPM), a variable-order Markov model that learns by internalizing n-grams from training sequences. However, PPM has limitations as a cognitive model: in particular, it has a perfect memory that weights all historic observations equally, which is inconsistent with memory capacity constraints and recency effects observed in human cognition. We address these limitations with PPM-Decay, a new variant of PPM that introduces a customizable memory decay kernel. In three studies-one with artificially generated sequences, one with chord sequences from Western music, and one with new behavioral data from an auditory pattern detection experiment-we show how this decay kernel improves the model's predictive performance for sequences whose underlying statistics change over time, and enables the model to capture effects of memory constraints on auditory pattern detection. The resulting model is available in our new open-source R package, ppm (https://github.com/pmcharrison/ppm).

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Fig 1. A simple decay kernel.**
The kernel is defined by an initial weight of w₀ = 1, an exponential decay with half life t_0.5 = 1 s, and an asymptotic weight w_∞ = 0.2.

**Fig 2. Illustrative plots for Experiment 1.**
A) Example sequence-generation models as randomly generated in Experiment 1. The bar plots describe 0th-order symbol distributions, whereas the matrices describe 1st-order transition probabilities. B) Repeated-measures plot indicating how predictive accuracy for individual sequences (N = 500, hollow circles) increases after the introduction of an exponential-decay kernel. C) Absolute changes in predictive accuracy for individual sequences, as summarised by a kernel density estimator. The median accuracy change is marked with a solid vertical line.

**Fig 3. Sample chord sequences analyzed in Experiment 2.**
A) represents the popular music corpus (‘Night Moves’, by Bob Seger), B) represents the jazz corpus (‘Thanks for the Memory’, by Leo Robin), and C) represents the Bach chorale harmonization corpus (‘Mit Fried und Freud ich fahr dahin’, by J. S. Bach). Each chord is labeled by its integer encoding within the chord alphabet for the respective corpus. Each chord sequence corresponds to the first eight chords of the first composition in the downsampled corpus. Each chord is defined by a combination of a bass pitch class (lower stave) and a collection of non-bass pitch classes (upper stave). For visualization purposes, bass pitch classes are assigned to the octave below middle C, and non-bass pitch classes to the octave above middle C.

**Fig 4. Predictive performances for different decay kernels in Experiment 2.**
Each composition contributed one cross-entropy value for each decay kernel; these cross-entropy values are expressed relative to the cross-entropy values of the original PPM model, and then summarised using kernel density estimators. Median performance improvements are marked with solid vertical lines.

**Fig 5. Example analysis of a single trial in Experiment 3.**
The three panels plot each tone’s frequency, change-point statistic, and information content respectively. ‘Nominal transition’ denotes the point at which the pattern changes from random tones to a repeating pattern of length 10. This repetition starts to become discernible after 10 tones (‘Effective transition’), at which point the sequence becomes fully deterministic. Correspondingly, information content (or ‘surprise’) drops, and triggers change-point detection at ‘Detection of transition’.

**Fig 6. Behavioral results for Experiment 3.**
A) Participant d-prime scores by condition, as summarized by violin plots and Tukey box plots. B) Participant mean response times by condition, as summarized by violin plots and Tukey box plots. C) As B, except benchmarking response times against the 25 ms conditions.

**Fig 7. Decay kernels employed in Experiment 3.**
The temporal duration of the buffer corresponds to the buffer’s informational capacity (15 tones) multiplied by the tone duration.

**Fig 8. Modeling participant data in Experiment 3.**
Participant data (mean response times) are plotted as white circles, whereas different model configurations (mean simulated response times) are plotted as solid bars. Error bars denote 95% confidence intervals computed using the central limit theorem. A) Progressively adding exponential weight decay and retrieval noise to the original PPM model. B) Progressively adding longer buffers to the PPM-Decay model.

**Fig 9. Schematic figure of accumulating observations within a memory buffer.**
Weights for the n-gram “AB” are displayed as a function of time, assuming an itemwise buffer capacity (n_b) of 5, a buffer weight (w₀) of 1.5, an initial post-buffer weight (w₁) of 1, a half life (t_0.5) of 1 second, and an asymptotic post-buffer weight (w_∞) of 0.

**Fig 10. Illustrative weight decay profile.**
This figure plots the weight of an n-gram of length one as a function of relative observer position, assuming that new symbols continue to be presented every 0.05 seconds. Model parameters are set to t_b = 2, n_b = 15, w₀ = 1.0, t_0.5 = 3.5, w₁ = 0.6, and w_∞ = 0, as optimized in Experiment 3.

**Fig 11. Illustration of the interpolated smoothing mechanism.**
This smoothing mechanism blends together maximum-likelihood n-gram models of different orders. Here the Markov order bound is two, the predictive context is “abracadabra”, and the task is to predict the next symbol. Columns are identified by Markov order; rows are organized into weight distributions, maximum-likelihood distributions, and interpolated distributions. Maximum-likelihood distributions are created by normalizing the corresponding weight distributions. Interpolated distributions are created by recursively combining the current maximum-likelihood distribution with the next-lowest-order interpolated distribution. The labelled arrows give the weight of each distribution, as computed using escape method “A”. The “Order = −1” column identifies the termination of the interpolated smoothing, and does not literally mean a Markov order of −1.

See this image and copyright information in PMC

Cited by

Implicit auditory memory in older listeners: From encoding to 6-month retention.
Bianco R, Hall ETR, Pearce MT, Chait M. Bianco R, et al. Curr Res Neurobiol. 2023 Nov 7;5:100115. doi: 10.1016/j.crneur.2023.100115. eCollection 2023. Curr Res Neurobiol. 2023. PMID: 38020808 Free PMC article.
Correction: PPM-Decay: A computational model of auditory prediction with memory decay.
Harrison PMC, Bianco R, Chait M, Pearce MT. Harrison PMC, et al. PLoS Comput Biol. 2021 May 26;17(5):e1008995. doi: 10.1371/journal.pcbi.1008995. eCollection 2021 May. PLoS Comput Biol. 2021. PMID: 34038404 Free PMC article.
Measuring self-similarity in empirical signals to understand musical beat perception.
Lenc T, Lenoir C, Keller PE, Polak R, Mulders D, Nozaradan S. Lenc T, et al. Eur J Neurosci. 2025 Jan;61(2):e16637. doi: 10.1111/ejn.16637. Eur J Neurosci. 2025. PMID: 39853878 Free PMC article.
Humans can find rhythm in randomly timed sounds.
van der Werff J, Tufarelli T, Verga L, Ravignani A. van der Werff J, et al. R Soc Open Sci. 2025 Aug 20;12(8):250453. doi: 10.1098/rsos.250453. eCollection 2025 Aug. R Soc Open Sci. 2025. PMID: 40843187 Free PMC article.
Expectation adaptation for rare cadences in music: Item order matters in repetition priming.
Chander A, Aslin RN. Chander A, et al. Cognition. 2023 Nov;240:105601. doi: 10.1016/j.cognition.2023.105601. Epub 2023 Aug 19. Cognition. 2023. PMID: 37604028 Free PMC article.

See all "Cited by" articles

References

1. Winkler I, Denham SL, Nelken I. Modeling the auditory scene: predictive regularity representations and perceptual objects. Trends in Cognitive Sciences. 2009;13(12):532–40. 10.1016/j.tics.2009.09.003 - DOI - PubMed
1. Wacongne C, Labyt E, Wassenhove V van, Bekinschtein T, Naccache L, Dehaene S. Evidence for a hierarchy of predictions and prediction errors in human cortex. Proceedings of the National Academy of Sciences. 2011;108(51):20754–9. 10.1073/pnas.1117807108 - DOI - PMC - PubMed
1. Barascud N, Pearce MT, Griffiths TD, Friston KJ, Chait M. Brain responses in humans reveal ideal observer-like sensitivity to complex acoustic patterns. Proceedings of the National Academy of Sciences of the United States of America. 2016;113(5):E616–25. 10.1073/pnas.1508523113 - DOI - PMC - PubMed
1. Garrido MI, Sahani M, Dolan RJ. Outlier responses reflect sensitivity to statistical structure in the human brain. PLoS Computational Biology. 2013;9(3). 10.1371/journal.pcbi.1002999 - DOI - PMC - PubMed
1. Agres K, Abdallah S, Pearce MT. Information-theoretic properties of auditory sequences dynamically influence expectation and memory. Cognitive Science. 2018;42:43–76. 10.1111/cogs.12477 - DOI - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

PPM-Decay: A computational model of auditory prediction with memory decay

Affiliations

PPM-Decay: A computational model of auditory prediction with memory decay

Authors

Affiliations

Erratum in

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical