Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 May 26;21(11):3712.
doi: 10.3390/s21113712.

Automatic Hyoid Bone Tracking in Real-Time Ultrasound Swallowing Videos Using Deep Learning Based and Correlation Filter Based Trackers

Affiliations

Automatic Hyoid Bone Tracking in Real-Time Ultrasound Swallowing Videos Using Deep Learning Based and Correlation Filter Based Trackers

Shurui Feng et al. Sensors (Basel). .

Abstract

(1) Background: Ultrasound provides a radiation-free and portable method for assessing swallowing. Hyoid bone locations and displacements are often used as important indicators for the evaluation of swallowing disorders. However, this requires clinicians to spend a great deal of time reviewing the ultrasound images. (2) Methods: In this study, we applied tracking algorithms based on deep learning and correlation filters to detect hyoid locations in ultrasound videos collected during swallowing. Fifty videos were collected from 10 young, healthy subjects for training, evaluation, and testing of the trackers. (3) Results: The best performing deep learning algorithm, Fully-Convolutional Siamese Networks (SiamFC), proved to have reliable performance in getting accurate hyoid bone locations from each frame of the swallowing ultrasound videos. While having a real-time frame rate (175 fps) when running on an RTX 2060, SiamFC also achieved a precision of 98.9% at the threshold of 10 pixels (3.25 mm) and 80.5% at the threshold of 5 pixels (1.63 mm). The tracker's root-mean-square error and average error were 3.9 pixels (1.27 mm) and 3.3 pixels (1.07 mm), respectively. (4) Conclusions: Our results pave the way for real-time automatic tracking of the hyoid bone in ultrasound videos for swallowing assessment.

Keywords: SiamFC; correlation filters; deep learning; dysphagia; hyoid bone; real-time; swallowing; tracking; ultrasound videos.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Figures

Figure 1
Figure 1
The left side of the figure shows an example ultrasound image with labeled anatomical structures, as illustrated on the right side. The hyoid bone annotation point was placed at the intersection of the geniohyoid muscle (left) and the acoustic shadow (above). During inference, a bounding box tracked the hyoid bone location.
Figure 2
Figure 2
Workflow of Siamese trackers. The center of the yellow bounding box indicates the hyoid location in the last frame; the green box and orange box centered on the last frame hyoid location will crop exemplar patch and search patch on the previous frame and current frame, respectively. The green point indicates the peak response on the score map, while the yellow one denotes the center location of the current frame.
Figure 3
Figure 3
Workflow of multi-stage trackers. Precise region of interests (PrRoI) pooling layers [35] can convert features of different sizes into the same size while enabling the computation of the gradient of Intersection over Union (IoU) with respect to the bounding box coordinates. IoU Net outputs IoU scores for each proposal, and the top three ranked proposals are averaged to produce a robust prediction bounding box location.
Figure 4
Figure 4
Workflow of correlation filter trackers.
Figure 5
Figure 5
Hyoid center trace plots between the timestamps of hyoid onset (the frame when the hyoid starts to move) to offset (at the moment when the hyoid starts to move away from its maximum position of superior-anterior movement) in 2D Cartesian axis (a), polar axis (b), x-axis (c), and y-axis (d). From an example test sequence of 10 mL thin liquid swallow, female subject, from hyoid onset to offset. The blue line represents the ground truth of the hyoid path, and the yellow one represents the inference path. Length unit at the polar axis is in pixels.
Figure 6
Figure 6
Precision plot shows the mean distance precision of 10 test sequences in full-length at different location error thresholds. The legend shows the precisions of different trackers at the threshold of 10 pixels.
Figure 7
Figure 7
Performance plot. Center error of all frames from a test sequence in full-length where the female swallows 10 mL of paste liquid. The y-axis is a center error in pixel and the x-axis is frame number. Three example images from every 200 frames were chosen and displayed above the plot. The pink dot represents the center of inference, and the blue dot represents the center of ground truth (annotation).
Figure 8
Figure 8
A frame in which two acoustic shadows of hyoid bone were seen due to fast hyoid movement speed. Ground truth location (blue dot) and prediction (pink dot with pink bounding box).

Similar articles

Cited by

References

    1. Smithard D.G. Dysphagia: A Geriatric Giant? Med. Clin. Rev. 2016;2:1–7. doi: 10.21767/2471-299X.1000014. - DOI
    1. Bloem B.R., Lagaay A.M., Van Beek W., Haan J., Roos R.A., Wintzen A.R. Prevalence of subjective dysphagia in community residents aged over 87. BMJ. 1990;300:721. doi: 10.1136/bmj.300.6726.721. - DOI - PMC - PubMed
    1. Humbert I.A., Robbins J. Dysphagia in the elderly. Phys. Med. Rehabil. Clin. N. Am. 2008;19:853–866. doi: 10.1016/j.pmr.2008.06.002. - DOI - PMC - PubMed
    1. O’Loughlin G., Shanley C. Swallowing problems in the nursing home: A novel training response. Dysphagia. 1998;13:172–183. doi: 10.1007/PL00009569. - DOI - PubMed
    1. Martino R., Foley N., Bhogal S., Diamant N., Speechley M., Teasell R. Dysphagia after stroke: Incidence, diagnosis, and pulmonary complications. Stroke. 2005;36:2756–2763. doi: 10.1161/01.STR.0000190056.76543.eb. - DOI - PubMed