Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Mar;149(3):1434.
doi: 10.1121/10.0003604.

Exponential spectro-temporal modulation generation

Affiliations

Exponential spectro-temporal modulation generation

Trevor A Stavropoulos et al. J Acoust Soc Am. 2021 Mar.

Abstract

Traditionally, real-time generation of spectro-temporally modulated noise has been performed on a linear amplitude scale, partially due to computational constraints. Experiments often require modulation that is sinusoidal on a logarithmic amplitude scale as a result of the many perceptual and physiological measures which scale linearly with exponential changes in the signal magnitude. A method is presented for computing exponential spectro-temporal modulation, showing that it can be expressed analytically as a sum over linearly offset sidebands with component amplitudes equal to the values of the modified Bessel function of the first kind. This approach greatly improves the efficiency and precision of stimulus generation over current methods, facilitating real-time generation for a broad range of carrier and envelope signals.

PubMed Disclaimer

Figures

FIG. 1.
FIG. 1.
(Color online) Spectrographic analysis of sideband-based STM. Multiple comparisons of the proposed sideband-based generation method with a classic numerical solution for a midpoint-to-peak modulation depth of 20 dB (40 dB peak-to-valley). (a) Spectrogram of the STM created with the proposed Bessel function sideband approach using a sideband extent of five (ten sidebands), (b) spectrogram of the STM created by exhaustively evaluating the explicit form for exponential modulation, and (c) ratio of the Bessel function sideband-generated stimulus to the explicit form of the stimulus. Note that the colormap limits are ±6×104 dB for the spectral power ratio.
FIG. 2.
FIG. 2.
(Color online) The ratio between the energy of each term in the partial sum over a limited number of sidebands and the energy of the complete infinite sum, expressed in decibels and shown for several midpoint-to-peak modulation depths. Because sidebands are distributed symmetrically about the carrier tone, they are counted in terms of “sideband extent,” which is half of the total number of sidebands. The visualized “power ratio” value can be interpreted as the energetic contribution of the omitted terms relative to the entire sum. The values converge to zero quickly enough such that few total datapoints are visible when visualized with a linear ordinate axis.
FIG. 3.
FIG. 3.
(Color online) Metrics of spectral envelope fluctuations in SM compared among three generation methods: black open circles for the explicit evaluation, red open squares for the existing method, and green filled triangles for the proposed sideband method. The calculations were performed on 100 exemplars across each of 20 modulation depths for each stimulus generation method. (a) The fourth moment of the spectrum (ordinate) as a function of the SM depth (abscissa). Error bars indicate the standard deviation across the 100 exemplars. (b) The crest factor of the spectral envelope (ordinate) as a function of the SM depth (abscissa). Error bars indicate the standard deviation across 100 samples.

References

    1. Chi T., Gao Y., Guyton M. C., Ru P., and Shamma S., “ Spectro-temporal modulation transfer functions and speech intelligibility,” J. Acoust. Soc. Am. 106(5), 2719–2732 (1999).10.1121/1.428100 - DOI - PubMed
    1. Elhilali M., Chi T., and Shamma S. A., “ A spectro-temporal modulation index (STMI) for assessment of speech intelligibility,” Speech Commun. 41(2), 331–348 (2003).10.1016/S0167-6393(02)00134-6 - DOI
    1. van Veen T. M. and Houtgast T., “ Spectral sharpness and vowel dissimilarity,” J. Acoust. Soc. Am. 77(2), 628–634 (1985).10.1121/1.391880 - DOI - PubMed
    1. Goossens T., van de Par S., and Kohlrausch A., “ On the ability to discriminate Gaussian-noise tokens or random tone-burst complexes,” J. Acoust. Soc. Am. 124(4), 2251–2262 (2008).10.1121/1.2973184 - DOI - PubMed
    1. Weber E. H., De Subtilitate Tactus (The Sense of Touch) ( Academic, London, 1978).

Publication types