Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Nov 11;16(11):e73491.
doi: 10.7759/cureus.73491. eCollection 2024 Nov.

The Rapidly Evolving Scenario of Acoustic Voice Analysis in Otolaryngology

Affiliations
Review

The Rapidly Evolving Scenario of Acoustic Voice Analysis in Otolaryngology

Marco Fantini et al. Cureus. .

Abstract

The field of voice analysis has experienced significant transformations, evolving from basic perceptual assessments to the incorporation of advanced digital signal processing and computational tools. This progression has facilitated a deeper understanding of the complex dynamics of vocal function, particularly through the use of acoustic voice analysis within a multidimensional evaluation framework. Traditionally, voice analysis relied on parameters such as fundamental frequency, jitter, shimmer, and noise-to-harmonic ratio, which, despite their utility, have faced criticism for variability and lack of robustness. Recent developments have led to a shift toward more reliable metrics such as cepstral measures, which offer improved accuracy in voice quality assessments. Furthermore, the integration of multiparametric constructs underscores a comprehensive approach to evaluating vocal quality, blending sustained vowels, and continuous speech analyses. Current trends in clinical practice increasingly favor these advanced measures over traditional parameters due to their greater reliability and clinical utility. Additionally, the emergence of artificial intelligence (AI), particularly deep learning, holds promise for revolutionizing voice analysis by enhancing diagnostic precision and enabling efficient, non-invasive screening methods. This shift toward AI-driven approaches signifies a potential paradigm change in voice health, suggesting a future where AI not only aids in diagnosis but also the early detection and treatment of voice-related pathologies.

Keywords: acoustic analysis; acoustic voice quality index; artificial intelligence; vocal quality; voice.

PubMed Disclaimer

Conflict of interest statement

Conflicts of interest: In compliance with the ICMJE uniform disclosure form, all authors declare the following: Payment/services info: All authors have declared that no financial support was received from any organization for the submitted work. Financial relationships: All authors have declared that they have no financial relationships at present or within the previous three years with any organizations that might have an interest in the submitted work. Other relationships: All authors have declared that there are no other relationships or activities that could appear to have influenced the submitted work.

Figures

Figure 1
Figure 1. Graphic representation of a cepstrum.
This figure was obtained using the free software PRAAT (www.fon.hum.uva.nl/praat/) by the author Marco Fantini.
Figure 2
Figure 2. Example of the Acoustic Voice Quality Index (AVQI), version 03.01, with the Italian language cut-off 2.35.
This figure was obtained using the free software PRAAT (www.fon.hum.uva.nl/praat/) by the author Marco Fantini.

References

    1. Hirano M. Clinical Examination of Voice. New York: Springer-Verlag; 1981. Psycho-acoustic evaluation of voice; pp. 81–84.
    1. Perceptual evaluation of dysphonia: reliability and relevance. Dejonckere PH, Obbens C, de Moor GM, Wieneke GH. Folia Phoniatr (Basel) 1993;45:76–83. - PubMed
    1. Titze IR. Iowa City, LA: National Center for Voice and Speech; 1995. Workshop on Acoustic Analysis: Summary Statement.
    1. A theoretical study of F0-F1 interaction with application to resonant speaking and singing voice. Titze IR. J Voice. 2004;18:292–298. - PubMed
    1. The acoustics of the singing voice. Sundberg J. Sci Am. 1977;236:82-4, 86, 88-91. - PubMed

LinkOut - more resources