Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Aug;146(2):1492.
doi: 10.1121/1.5124256.

The effect of high-speed videoendoscopy configuration on reduced-order model parameter estimates by Bayesian inference

Affiliations

The effect of high-speed videoendoscopy configuration on reduced-order model parameter estimates by Bayesian inference

Jonathan J Deng et al. J Acoust Soc Am. 2019 Aug.

Abstract

Bayesian inference has been previously demonstrated as a viable inverse analysis tool for estimating subject-specific reduced-order model parameters and uncertainties. However, previous studies have relied upon simulated glottal area waveforms with superimposed random noise as the measurement. In practice, high-speed videoendoscopy is used to measure glottal area, which introduces practical imaging effects not captured in simulated data, such as viewing angle, frame rate, and camera resolution. Herein, high-speed videos of the vocal folds were approximated by recording the trajectories of physical vocal fold models controlled by a symmetric body-cover model. Twenty videos were recorded, varying subglottal pressure, cricothyroid activation, and viewing angle, with frame rate and video resolution varied by digital video manipulation. Bayesian inference was used to estimate subglottal pressure and cricothyroid activation from glottal area waveforms extracted from the videos. The resulting estimates show off-axis viewing of 10° can lead to a 10% bias in the estimated subglottal pressure. A viewing model is introduced such that viewing angle can be included as an estimated parameter, which alleviates estimate bias. Frame rate and pixel resolution were found to primarily affect uncertainty of parameter estimates up to a limit where spatial and temporal resolutions were too poor to resolve the glottal area. Since many high-speed cameras have the ability to sacrifice spatial for temporal resolution, the findings herein suggest that Bayesian inference studies employing high-speed video should increase temporal resolutions at the expense of spatial resolution for reduced estimate uncertainties.

PubMed Disclaimer

Figures

FIG. 1.
FIG. 1.
A schematic of the simulated HSV experimental setup. A pair of rigid two-dimensional VF medial surfaces (a coronal cross-section) are driven by a motion system that provides one translational and one rotational degree of freedom, as in the bar-plate VF model (Ref. 7). A consumer DSLR is used to capture the VF motion. A calibration plate with two dots, shown as circles, allow alignment of the camera view along a specified angle α.
FIG. 2.
FIG. 2.
The mapping procedure connects points on the BCM to points on the physical VF geometry. Each physical VF is controlled by two degrees of freedom (s and θ). These are mapped to the cover mass displacements (xu and xl) of the BCM. The rigid VF dimensions are given by: Lu=17.07mm,Ll=15.11mm,αu=27.71°,Rl=22.23mm, and Ru=14.29mm.
FIG. 3.
FIG. 3.
(Color online) (a) Sample image extracted from a sample video recording. (b) The Laplacian computed using a 3 × 3 px kernel over row 100 of the image in (a). The open and closed circles are peaks of the Laplacian.
FIG. 4.
FIG. 4.
(Color online) The time averaged variance of the glottal width over rows of frames normalized by the variance at the highest resolution for various cases of known parameters and angles of view. Each point corresponds to the time averaged variance computed for one of the known (Psub,act) cases at each of the angles of view α=(0°,2.5°,5.0°).
FIG. 5.
FIG. 5.
(Color online) Sample glottal width waveform extracted from HSV for (Psub,act)=(1800Pa,0.15) in physiological dimensions.
FIG. 6.
FIG. 6.
(Color online) Observed glottal width as a function of camera viewing angle α for (Psub, act) = (1800 Pa, 0.15). Angles of view correspond to: –––– 0°, - - - 5°, -·-· 10°.
FIG. 7.
FIG. 7.
(Color online) The estimated (a) Psub and (b) act with increasing viewing angle. (c) The relative uncertainty with increasing viewing angle. Known reference parameters are: (Psub, act) = –––– (1800 Pa, 0.15), - - - (2000 Pa, 0.15), -·-· (1800 Pa, 0.20), …. (2000 Pa, 0.20).
FIG. 8.
FIG. 8.
(Color online) The estimated (a) Psub and (b) act with decreasing frame rate. (c) The relative uncertainty with decreasing frame rate. Known reference parameters are: (Psub,act)= –––– (1800 Pa, 0.15), - - - (2000 Pa, 0.15), -·-· (1800 Pa, 0.20), …. (2000 Pa, 0.20).
FIG. 9.
FIG. 9.
(Color online) The measured glottal width waveform for various degrees of spatial downsampling. Each resolution corresponds to a magnification factor of approximately: –––– 0.013 mm px−1, - - - 0.053 mm px−1, -·-· 0.21 mm px−1 for (Psub, act) = (1800 Pa, 0.15).
FIG. 10.
FIG. 10.
(Color online) The estimated (a) Psub and (b) act, and (c) relative uncertainty with decreasing spatial resolution. Known reference parameters are: (Psub, act) = ––––– (1800 Pa, 0.15), - - - (2000 Pa, 0.15), -·-· (1800 Pa, 0.20), …. (2000 Pa, 0.20). Note that the x axis corresponds to different downsampling factors of the spatial resolution.
FIG. 11.
FIG. 11.
(Color online) The estimated (a) Psub and (b) act. The known reference parameters are: (Psub,act)=(1800Pa,0.15) and viewing angles are: ––––– 0°, -·-· 7.5°, …. 10.0°.
FIG. 12.
FIG. 12.
(Color online) The measured glottal widths at the highest —- and lowest -·-· resolutions at α=10° for (Psub, act) = (1800 Pa, 0.15).
FIG. 13.
FIG. 13.
(Color online) The relative uncertainty at (a) α=0° and (b) α=10° with changing spatial and temporal resolutions for the case (Psub,act)=(1800Pa,0.15). Different spatial resolutions correspond to: –––– ≈ 0.0066 [mm px−1], -·-· ≈ 0.053 [mm px−1] and …. ≈ 0.21 [mm px−1].

Similar articles

Cited by

References

    1. Lucero J. C., Lourenço K. G., Hermant N., Van Hirtum A., and Pelorson X., “ Effect of source–tract acoustical coupling on the oscillation onset of the vocal folds,” J. Acoust. Soc. Am. 132, 403–411 (2012).10.1121/1.4728170 - DOI - PubMed
    1. Ruty N., Pelorson X., Van Hirtum A., Lopez-Arteaga I., and Hirschberg A., “ An in vitro setup to test the relevance and the accuracy of low-order vocal folds models,” J. Acoust. Soc. Am. 121, 479–490 (2007).10.1121/1.2384846 - DOI - PubMed
    1. Zañartu M., Galindo G. E., Erath B. D., Peterson S. D., Wodicka G. R., and Hillman R. E., “ Modeling the effects of a posterior glottal opening on vocal fold dynamics with implications for vocal hyperfunction,” J. Acoust. Soc. Am. 136, 3262–3271 (2014).10.1121/1.4901714 - DOI - PMC - PubMed
    1. Erath B. D., Peterson S. D., Zañartu M., Wodicka G. R., and Plesniak M. W., “ A theoretical model of the pressure field arising from asymmetric intraglottal flows applied to a two-mass model of the vocal folds,” J. Acoust. Soc. Am. 130, 389–403 (2011).10.1121/1.3586785 - DOI - PubMed
    1. Erath B. D., Sommer D. E., Peterson S. D., and Zañartu M., “ Nonlinearities in block-type reduced-order vocal fold models with asymmetric tissue properties,” Proc. Mtg. Acoust. 19, 060243 (2013).10.1121/1.4800662 - DOI - PubMed

Publication types