Uncertainty of Spatial Segmentation of High-Speed Videoendoscopy and Its Temporal and Spatial Dependency
- PMID: 40157854
- PMCID: PMC12353461
- DOI: 10.1016/j.jvoice.2025.03.007
Uncertainty of Spatial Segmentation of High-Speed Videoendoscopy and Its Temporal and Spatial Dependency
Abstract
Objective: Spatial segmentation of high-speed videoendoscopy (HSV) is the process that detects the edges of the vocal folds and represents them in analytic form. The level of spatial segmentation uncertainty (ie, how close vs. far apart different experts marked the edges of the vocal folds) can have a great impact on the level of uncertainty of the final measures (ie, their dispersion). This study quantified the uncertainty of spatial segmentation and investigated its dependency on the phase of the glottal cycle and the location of vocal fold edges along the anterior-posterior direction.
Method: Three experts manually segmented the vocal fold edges of twelve HSV recordings using an iterative process consisting of an initial segmentation followed by a blinded reconciliation phase. Segmentation uncertainty was computed as the distance in pixels between the three-segmented edges at the end of the iterative process. The relationships between segmentation uncertainty and different sections of the glottis along the anterior-posterior direction and the relationships between segmentation uncertainty and different phases of the glottal cycle were quantified.
Results: Segmentation uncertainties of the anterior and the posterior sections of the glottis were significantly higher than the middle section, while uncertainty of the anterior section was the highest and 40% larger than the middle section. The average segmentation uncertainty and normalized glottal area were positively correlated. Segmentation uncertainty of the most open glottal configurations was 31% larger than the most closed glottal configuration.
Conclusion: The uncertainty of spatial segmentation of the vocal fold edges depends on the phase of the glottal cycle and the location of the edge along the anterior-posterior direction; hence, it is expected for different HSV measures to have different levels of uncertainties. The implications of these findings for vocal fold velocity measures are discussed. Additionally, the findings from this study could provide direction for future automated spatial segmentation methods and for creating a robust and reliable automated HSV processing pipeline.
Keywords: Uncertainty—Reliability—Spatial segmentation—Vocal fold edges—Vocal fold velocity..
Copyright © 2025 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Conflict of interest statement
Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
References
-
- Titze IR, Lemke J, and Montequin D, “Populations in the US workforce who rely on voice as a primary tool of trade: a preliminary report,” Journal of voice, vol. 11, no. 3, pp. 254–259, 1997. - PubMed
-
- Hogikyan ND and Sethuraman G, “Validation of an instrument to measure voice-related quality of life (V-RQOL),” Journal of voice, vol. 13, no. 4, pp. 557–569, 1999. - PubMed
-
- Verdolini K and Ramig LO, “Review: Occupational risks for voice problems,” Logoped Phoniatr Vocol, vol. 26, no. 1, pp. 37–46, 2001. - PubMed
-
- Cohen SM, Kim J, Roy N, Asche C, and Courey M, “Direct health care costs of laryngeal diseases and disorders,” Laryngoscope, vol. 122, no. 7, pp. 1582–1588, 2012. - PubMed
-
- Cohen SM, Kim J, Roy N, Asche C, and Courey M, “The impact of laryngeal disorders on work-related dysfunction,” Laryngoscope, vol. 122, no. 7, pp. 1589–1594, 2012. - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources
