. 2019 Aug;146(2):1492.

doi: 10.1121/1.5124256.

The effect of high-speed videoendoscopy configuration on reduced-order model parameter estimates by Bayesian inference

Jonathan J Deng¹, Paul J Hadwin¹, Sean D Peterson¹

Affiliations

PMID: 31472542
PMCID: PMC6715443
DOI: 10.1121/1.5124256

The effect of high-speed videoendoscopy configuration on reduced-order model parameter estimates by Bayesian inference

Jonathan J Deng et al. J Acoust Soc Am. 2019 Aug.

. 2019 Aug;146(2):1492.

doi: 10.1121/1.5124256.

Authors

Jonathan J Deng¹, Paul J Hadwin¹, Sean D Peterson¹

Affiliation

¹ Department of Mechanical and Mechatronics Engineering, University of Waterloo, Ontario N2L 3G1, Canada.

PMID: 31472542
PMCID: PMC6715443
DOI: 10.1121/1.5124256

Abstract

Bayesian inference has been previously demonstrated as a viable inverse analysis tool for estimating subject-specific reduced-order model parameters and uncertainties. However, previous studies have relied upon simulated glottal area waveforms with superimposed random noise as the measurement. In practice, high-speed videoendoscopy is used to measure glottal area, which introduces practical imaging effects not captured in simulated data, such as viewing angle, frame rate, and camera resolution. Herein, high-speed videos of the vocal folds were approximated by recording the trajectories of physical vocal fold models controlled by a symmetric body-cover model. Twenty videos were recorded, varying subglottal pressure, cricothyroid activation, and viewing angle, with frame rate and video resolution varied by digital video manipulation. Bayesian inference was used to estimate subglottal pressure and cricothyroid activation from glottal area waveforms extracted from the videos. The resulting estimates show off-axis viewing of 10° can lead to a 10% bias in the estimated subglottal pressure. A viewing model is introduced such that viewing angle can be included as an estimated parameter, which alleviates estimate bias. Frame rate and pixel resolution were found to primarily affect uncertainty of parameter estimates up to a limit where spatial and temporal resolutions were too poor to resolve the glottal area. Since many high-speed cameras have the ability to sacrifice spatial for temporal resolution, the findings herein suggest that Bayesian inference studies employing high-speed video should increase temporal resolutions at the expense of spatial resolution for reduced estimate uncertainties.

PubMed Disclaimer

Figures

**FIG. 1.**
A schematic of the simulated HSV experimental setup. A pair of rigid two-dimensional VF medial surfaces (a coronal cross-section) are driven by a motion system that provides one translational and one rotational degree of freedom, as in the bar-plate VF model (Ref. 7). A consumer DSLR is used to capture the VF motion. A calibration plate with two dots, shown as circles, allow alignment of the camera view along a specified angle α.

**FIG. 2.**
The mapping procedure connects points on the BCM to points on the physical VF geometry. Each physical VF is controlled by two degrees of freedom (s and θ). These are mapped to the cover mass displacements (x_u and x_l) of the BCM. The rigid VF dimensions are given by: $L_{u} = 17.07 mm, L_{l} = 15.11 mm, α_{u} = 27.71 °, R_{l} = 22.23 mm$ , and $R_{u} = 14.29 mm$ .

**FIG. 3.**
(Color online) (a) Sample image extracted from a sample video recording. (b) The Laplacian computed using a 3 × 3 px kernel over row 100 of the image in (a). The open and closed circles are peaks of the Laplacian.

**FIG. 4.**
(Color online) The time averaged variance of the glottal width over rows of frames normalized by the variance at the highest resolution for various cases of known parameters and angles of view. Each point corresponds to the time averaged variance computed for one of the known $(P_{sub}, a_{ct})$ cases at each of the angles of view $α = (0 °, 2.5 °, 5.0 °)$ .

**FIG. 5.**
(Color online) Sample glottal width waveform extracted from HSV for $(P_{sub}, a_{ct}) = (1800 Pa, 0.15)$ in physiological dimensions.

**FIG. 6.**
(Color online) Observed glottal width as a function of camera viewing angle α for (P_sub, a_ct) = (1800 Pa, 0.15). Angles of view correspond to: –––– $0 °$ , - - - $5 °$ , -·-· $10 °$ .

**FIG. 7.**
(Color online) The estimated (a) P_sub and (b) a_ct with increasing viewing angle. (c) The relative uncertainty with increasing viewing angle. Known reference parameters are: (P_sub, a_ct) = –––– (1800 Pa, 0.15), - - - (2000 Pa, 0.15), -·-· (1800 Pa, 0.20), …. (2000 Pa, 0.20).

**FIG. 8.**
(Color online) The estimated (a) P_sub and (b) a_ct with decreasing frame rate. (c) The relative uncertainty with decreasing frame rate. Known reference parameters are: $(P_{sub}, a_{ct}) =$ –––– (1800 Pa, 0.15), - - - (2000 Pa, 0.15), -·-· (1800 Pa, 0.20), …. (2000 Pa, 0.20).

**FIG. 9.**
(Color online) The measured glottal width waveform for various degrees of spatial downsampling. Each resolution corresponds to a magnification factor of approximately: –––– 0.013 mm px⁻¹, - - - 0.053 mm px⁻¹, -·-· 0.21 mm px⁻¹ for (P_sub, a_ct) = (1800 Pa, 0.15).

**FIG. 10.**
(Color online) The estimated (a) P_sub and (b) a_ct, and (c) relative uncertainty with decreasing spatial resolution. Known reference parameters are: (P_sub, a_ct) = ––––– (1800 Pa, 0.15), - - - (2000 Pa, 0.15), -·-· (1800 Pa, 0.20), …. (2000 Pa, 0.20). Note that the x axis corresponds to different downsampling factors of the spatial resolution.

**FIG. 11.**
(Color online) The estimated (a) P_sub and (b) a_ct. The known reference parameters are: $(P_{sub}, a_{ct}) = (1800 Pa, 0.15)$ and viewing angles are: ––––– $0 °$ , -·-· $7.5 °$ , …. $10.0 °$ .

**FIG. 12.**
(Color online) The measured glottal widths at the highest —- and lowest -·-· resolutions at $α = 10 °$ for (P_sub, a_ct) = (1800 Pa, 0.15).

**FIG. 13.**
(Color online) The relative uncertainty at (a) $α = 0 °$ and (b) $α = 10 °$ with changing spatial and temporal resolutions for the case $(P_{sub}, a_{ct}) = (1800 Pa, 0.15)$ . Different spatial resolutions correspond to: –––– ≈ 0.0066 [mm px⁻¹], -·-· ≈ 0.053 [mm px⁻¹] and …. ≈ 0.21 [mm px⁻¹].

See this image and copyright information in PMC

Cited by

Exploring the mechanics of fundamental frequency variation during phonation onset.
Serry MA, Stepp CE, Peterson SD. Serry MA, et al. Biomech Model Mechanobiol. 2023 Feb;22(1):339-356. doi: 10.1007/s10237-022-01652-8. Epub 2022 Nov 12. Biomech Model Mechanobiol. 2023. PMID: 36370231 Free PMC article.
The influence of flow model selection on finite element model parameter estimation using Bayesian inference.
Hadwin PJ, Erath BD, Peterson SD. Hadwin PJ, et al. JASA Express Lett. 2021 Apr;1(4):045204. doi: 10.1121/10.0004260. Epub 2021 Apr 15. JASA Express Lett. 2021. PMID: 34136884 Free PMC article.
Non-Linear Image Distortions in Flexible Fiberoptic Endoscopes and their Effects on Calibrated Horizontal Measurements Using High-Speed Videoendoscopy.
Ghasemzadeh H, Deliyski DD. Ghasemzadeh H, et al. J Voice. 2022 Nov;36(6):755-769. doi: 10.1016/j.jvoice.2020.08.029. Epub 2020 Sep 18. J Voice. 2022. PMID: 32958427 Free PMC article.
Voice Feature Selection to Improve Performance of Machine Learning Models for Voice Production Inversion.
Zhang Z. Zhang Z. J Voice. 2023 Jul;37(4):479-485. doi: 10.1016/j.jvoice.2021.03.004. Epub 2021 Apr 11. J Voice. 2023. PMID: 33849760 Free PMC article.
Estimation of Subglottal Pressure, Vocal Fold Collision Pressure, and Intrinsic Laryngeal Muscle Activation From Neck-Surface Vibration Using a Neural Network Framework and a Voice Production Model.
Ibarra EJ, Parra JA, Alzamendi GA, Cortés JP, Espinoza VM, Mehta DD, Hillman RE, Zañartu M. Ibarra EJ, et al. Front Physiol. 2021 Sep 1;12:732244. doi: 10.3389/fphys.2021.732244. eCollection 2021. Front Physiol. 2021. PMID: 34539451 Free PMC article.

References

1. Lucero J. C., Lourenço K. G., Hermant N., Van Hirtum A., and Pelorson X., “ Effect of source–tract acoustical coupling on the oscillation onset of the vocal folds,” J. Acoust. Soc. Am. 132, 403–411 (2012).10.1121/1.4728170 - DOI - PubMed
1. Ruty N., Pelorson X., Van Hirtum A., Lopez-Arteaga I., and Hirschberg A., “ An in vitro setup to test the relevance and the accuracy of low-order vocal folds models,” J. Acoust. Soc. Am. 121, 479–490 (2007).10.1121/1.2384846 - DOI - PubMed
1. Zañartu M., Galindo G. E., Erath B. D., Peterson S. D., Wodicka G. R., and Hillman R. E., “ Modeling the effects of a posterior glottal opening on vocal fold dynamics with implications for vocal hyperfunction,” J. Acoust. Soc. Am. 136, 3262–3271 (2014).10.1121/1.4901714 - DOI - PMC - PubMed
1. Erath B. D., Peterson S. D., Zañartu M., Wodicka G. R., and Plesniak M. W., “ A theoretical model of the pressure field arising from asymmetric intraglottal flows applied to a two-mass model of the vocal folds,” J. Acoust. Soc. Am. 130, 389–403 (2011).10.1121/1.3586785 - DOI - PubMed
1. Erath B. D., Sommer D. E., Peterson S. D., and Zañartu M., “ Nonlinearities in block-type reduced-order vocal fold models with asymmetric tissue properties,” Proc. Mtg. Acoust. 19, 060243 (2013).10.1121/1.4800662 - DOI - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

P50 DC015446/DC/NIDCD NIH HHS/United States

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

The effect of high-speed videoendoscopy configuration on reduced-order model parameter estimates by Bayesian inference

Affiliation

The effect of high-speed videoendoscopy configuration on reduced-order model parameter estimates by Bayesian inference

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical