Sensitivity of Acoustic Voice Quality Measures in Simulated Reverberation Conditions
- PMID: 39768071
- PMCID: PMC11673399
- DOI: 10.3390/bioengineering11121253
Sensitivity of Acoustic Voice Quality Measures in Simulated Reverberation Conditions
Abstract
Room reverberation can affect oral/aural communication and is especially critical in computer analysis of voice. High levels of reverberation can distort voice recordings, impacting the accuracy of quantifying voice production quality and vocal health evaluations. This study quantifies the impact of additive simulated reverberation on otherwise clean voice recordings as reflected in voice metrics commonly used for voice quality evaluation. From a larger database of voice recordings collected in a low-noise, low-reverberation environment, voice samples of a sustained [a:] vowel produced at two different speaker intents (comfortable and clear) by five healthy voice college-age female native English speakers were used. Using the reverb effect in Audacity, eight reverberation situations indicating a range of reverberation times (T20 between 0.004 and 1.82 s) were simulated and convolved with the original recordings. All voice samples, both original and reverberation-affected, were analyzed using freely available PRAAT software (version 6.0.13) to calculate five common voice parameters: jitter, shimmer, harmonic-to-noise ratio (HNR), alpha ratio, and smoothed cepstral peak prominence (CPPs). Statistical analyses assessed the sensitivity and variations in voice metrics to a range of simulated room reverberation conditions. Results showed that jitter, HNR, and alpha ratio were stable at simulated reverberation times below T20 of 1 s, with HNR and jitter more stable in the clear vocal style. Shimmer was highly sensitive even at T20 of 0.53 s, which would reflect a common room, while CPPs remained stable across all simulated reverberation conditions. Understanding the sensitivity and stability of these voice metrics to a range of room acoustics effects allows for targeted use of certain metrics even in less controlled environments, enabling selective application of stable measures like CPPs and cautious interpretation of shimmer, ensuring more reliable and accurate voice assessments.
Keywords: reverberation; sensitivity; simulated room acoustics; speech acoustics; voice metrics.
Conflict of interest statement
The authors confirm that there are no conflicts of interest regarding the work introduced in the present paper.
Figures






Similar articles
-
An Assessment of Different Praat Versions for Acoustic Measures Analyzed Automatically by VoiceEvalU8 and Manually by Two Raters.J Voice. 2023 Jan;37(1):17-25. doi: 10.1016/j.jvoice.2020.12.003. Epub 2020 Dec 29. J Voice. 2023. PMID: 33384248 Free PMC article.
-
Evaluation of Acoustic Analyses of Voice in Nonoptimized Conditions.J Speech Lang Hear Res. 2020 Dec 14;63(12):3991-3999. doi: 10.1044/2020_JSLHR-20-00212. Epub 2020 Nov 13. J Speech Lang Hear Res. 2020. PMID: 33186510
-
Acoustic Effects of Vocal Warm-Up: A 7-Week Longitudinal Case Study.J Voice. 2024 Mar;38(2):458-465. doi: 10.1016/j.jvoice.2021.09.030. Epub 2021 Nov 26. J Voice. 2024. PMID: 34844825 Free PMC article.
-
Worldwide Healthy Adult Voice Baseline Parameters: A Comprehensive Review.J Voice. 2022 Sep;36(5):637-649. doi: 10.1016/j.jvoice.2020.08.028. Epub 2020 Oct 8. J Voice. 2022. PMID: 33039203 Review.
-
The Rapidly Evolving Scenario of Acoustic Voice Analysis in Otolaryngology.Cureus. 2024 Nov 11;16(11):e73491. doi: 10.7759/cureus.73491. eCollection 2024 Nov. Cureus. 2024. PMID: 39669823 Free PMC article. Review.
References
-
- Yousef A.M. Ph.D. Dissertation. Michigan State University; East Lansing, MI, USA: 2023. Laryngeal Mechanisms and Vocal Folds Function in Adductor Laryngeal Dystonia During Connected Speech.
Grants and funding
LinkOut - more resources
Full Text Sources