Perceptual interaction of the harmonic source and noise in voice

Jody Kreiman¹, Bruce R Gerratt

Affiliations

PMID: 22280610
PMCID: PMC3283904
DOI: 10.1121/1.3665997

Perceptual interaction of the harmonic source and noise in voice

Jody Kreiman et al. J Acoust Soc Am. 2012 Jan.

. 2012 Jan;131(1):492-500.

doi: 10.1121/1.3665997.

Authors

Jody Kreiman¹, Bruce R Gerratt

Affiliation

¹ Division of Head and Neck Surgery, UCLA School of Medicine, 31-24 Rehab Center, Los Angeles, California 90095-1794, USA. jkreiman@ucla.edu

PMID: 22280610
PMCID: PMC3283904
DOI: 10.1121/1.3665997

Abstract

Although the amount of inharmonic energy (noise) present in a human voice is an important determinant of vocal quality, little is known about the perceptual interaction between harmonic and inharmonic aspects of the voice source. This paper reports three experiments investigating this issue. Results indicate that perception of the harmonic slope and of noise levels are both influenced by complex interactions between the spectral shape and relative levels of harmonic and noise energy in the voice source. Just-noticeable differences (JNDs) for the noise-to-harmonics ratio (NHR) varied significantly with the NHR and harmonic spectral slope, but NHR had no effect on JNDs for NHR when harmonic slopes were steepest, and harmonic slope had no effect when NHRs were highest. Perception of changes in the harmonic source slope depended on NHR and on the harmonic source slope: JNDs increased when spectra rolled off steeply, with this effect in turn depending on NHR. Finally, all effects were modulated by the shape of the noise spectrum. It thus appears that, beyond masking, understanding perception of individual parameters requires knowledge of the acoustic context in which they function, consistent with the view that voices are integral patterns that resist decomposition.

PubMed Disclaimer

Figures

**Figure 1**
(Color online) Manipulations of the harmonic voice source spectrum. Listeners adjust the slope of H2–Hn by typing the desired slope value into a box (not shown) and then clicking the point labeled with a double arrow. Note that H1–H2 remains constant throughout manipulations of H2–Hn.

**Figure 2**
Variations in sensitivity to changes in NHR and H2–Hn as a function of baseline H2–Hn. The y axis shows the ratio of the JND to range for the NHR (an index of listeners’ overall sensitivity), and the x axis shows baseline H2–Hn. (a) Sensitivity to changes in NHR. (b) Sensitivity to changes in H2–Hn. Values for NHR = −40 dB (noise free) are plotted with filled circles; open squares represent values when the NHR = −30 dB; asterisks show values when the NHR = −20 dB; and filled triangles indicate values when the NHR = −10 dB (very noisy). Ellipses enclose points that do not differ significantly. See text for fuller description.

**Figure 3**
Representative noise spectra. Units on the y axis are arbitrary. (a) A typical falling spectrum. (b) A typical rising spectrum. (c) A typical flat spectrum.

**Figure 4**
(Color online) Changes in discrimination accuracy (measured by d′) for the three noise sources, as a function of changes in (a) NHR and (b) H2–Hn. See text for more discussion.

See this image and copyright information in PMC

References

1. Andics, A., McQueen, J. M., Petersson, K. M., Gál, V., Rudas, G., and Vidnyánszky, Z. (2010). “Neural mechanisms for voice recognition,” Neuroimage 52, 1528–1540.10.1016/j.neuroimage.2010.05.048 - DOI - PubMed
1. Brockmann, M., Storck, C., Carding, P. N., and Drinnan, M. J. (2008). “Voice loudness and gender effects on jitter and shimmer in healthy adults,” J. Speech Lang. Hear. Res. 51, 1152–1160.10.1044/1092-4388(2008/06-0208) - DOI - PubMed
1. Buder, E. H. (2000). “Acoustic analysis of voice quality: A tabulation of algorithms 1902–1990,” in Voice Quality Measurement, edited by Kent R. D. (Singular, San Diego, CA: ), pp. 119–244.
1. de Krom, G. (1993). “A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals,” J. Speech Hear. Res. 36, 254–266. - PubMed
1. de Krom, G. (1995). “Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments,” J. Speech Hear. Res. 38, 794–811. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Perceptual interaction of the harmonic source and noise in voice

Affiliation

Perceptual interaction of the harmonic source and noise in voice

Authors

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical