A two-stage spectral model for sound texture perception: Synthesis and psychophysics
- PMID: 36845027
- PMCID: PMC9950610
- DOI: 10.1177/20416695231157349
A two-stage spectral model for sound texture perception: Synthesis and psychophysics
Abstract
The natural environment is filled with a variety of auditory events such as wind blowing, water flowing, and fire crackling. It has been suggested that the perception of such textural sounds is based on the statistics of the natural auditory events. Inspired by a recent spectral model for visual texture perception, we propose a model that can describe the perceived sound texture only with the linear spectrum and the energy spectrum. We tested the validity of the model by using synthetic noise sounds that preserve the two-stage amplitude spectra of the original sound. Psychophysical experiment showed that our synthetic noises were perceived as like the original sounds for 120 real-world auditory events. The performance was comparable with the synthetic sounds produced by McDermott-Simoncelli's model which considers various classes of auditory statistics. The results support the notion that the perception of natural sound textures is predictable by the two-stage spectral signals.
Keywords: listening; models; texture; visuo-auditory interactions.
© The Author(s) 2023.
Conflict of interest statement
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Figures
References
-
- Attias H., Schreiner C. (1997). Coding of naturalistic stimuli by auditory midbrain neurons. Advances in Neural Information Processing Systems, 10, 103–109.
-
- Bergen J. R., Landy M. S. (1991). Computational modeling of visual texture segregation. Computational Models of Visual Processing, 17, 253–271.
LinkOut - more resources
Full Text Sources
