Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 May 11;24(10):3057.
doi: 10.3390/s24103057.

Novel Approach Combining Shallow Learning and Ensemble Learning for the Automated Detection of Swallowing Sounds in a Clinical Database

Affiliations

Novel Approach Combining Shallow Learning and Ensemble Learning for the Automated Detection of Swallowing Sounds in a Clinical Database

Satoru Kimura et al. Sensors (Basel). .

Abstract

Cervical auscultation is a simple, noninvasive method for diagnosing dysphagia, although the reliability of the method largely depends on the subjectivity and experience of the evaluator. Recently developed methods for the automatic detection of swallowing sounds facilitate a rough automatic diagnosis of dysphagia, although a reliable method of detection specialized in the peculiar feature patterns of swallowing sounds in actual clinical conditions has not been established. We investigated a novel approach for automatically detecting swallowing sounds by a method wherein basic statistics and dynamic features were extracted based on acoustic features: Mel Frequency Cepstral Coefficients and Mel Frequency Magnitude Coefficients, and an ensemble learning model combining Support Vector Machine and Multi-Layer Perceptron were applied. The evaluation of the effectiveness of the proposed method, based on a swallowing-sounds database synchronized to a video fluorographic swallowing study compiled from 74 advanced-age patients with dysphagia, demonstrated an outstanding performance. It achieved an F1-micro average of approximately 0.92 and an accuracy of 95.20%. The method, proven effective in the current clinical recording database, suggests a significant advancement in the objectivity of cervical auscultation. However, validating its efficacy in other databases is crucial for confirming its broad applicability and potential impact.

Keywords: automated detection; swallowing sound.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
Example of swallowing sounds detected from video fluorographic swallowing study (VFSS).
Figure 2
Figure 2
Steps from recording data to loud event detection. (a) Recording data waveform, including swallowing during jelly ingestion (red vertical line area is the swallowing-sound interval). (b) Waveform of recording data after bandpass filter processing. (c) Spectrogram for recording data waveform. (d) Recording of loud events and swallowing events (light-colored highlighted sections).
Figure 3
Figure 3
Steps for calculating the MFCC and MFMC.
Figure 4
Figure 4
Comparison of number of feature dimensions and swallowing-sound classification performance.

Similar articles

References

    1. Clavé P., Terré R., de Kraa M., Serra M. Approaching Oropharyngeal Dysphagia. Rev. Esp. Enferm. Dig. 2004;96:119–131. doi: 10.4321/S1130-01082004000200005. - DOI - PubMed
    1. Aslam M., Vaezi M.F. Dysphagia in the Elderly. Gastroenterol. Hepatol. 2013;9:784–795. - PMC - PubMed
    1. Sura L., Madhavan A., Carnaby G., Crary M.A. Dysphagia in the Elderly: Management and Nutritional Considerations. Clin. Interv. Aging. 2012;7:287–298. - PMC - PubMed
    1. Nawaz S., Tulunay-Ugur O.E. Dysphagia in the Older Patient. Otolaryngol. Clin. N. Am. 2018;51:769–777. doi: 10.1016/j.otc.2018.03.006. - DOI - PubMed
    1. Kertscher B., Speyer R., Palmieri M., Plant C. Bedside Screening to Detect Oropharyngeal Dysphagia in Patients with Neurological Disorders: An Updated Systematic Review. Dysphagia. 2014;29:204–212. doi: 10.1007/s00455-013-9490-9. - DOI - PubMed