Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Sep 12.
doi: 10.1007/s00455-025-10879-3. Online ahead of print.

Using Machine Learning for the Automated Segmentation and Detection of Swallows Obtained by Digital Cervical Auscultation in Preterm Neonates

Affiliations

Using Machine Learning for the Automated Segmentation and Detection of Swallows Obtained by Digital Cervical Auscultation in Preterm Neonates

Stephen So et al. Dysphagia. .

Abstract

The clinical application of acoustic swallowing sound parameters collected from digital cervical auscultation is limited because of the time-consuming manual segmentation required by trained experts. The automated identification of swallowing sounds in children and adults from swallowing sound audio wavefiles using machine learning have accuracies between 76-95%. No data exists in preterm neonates. To determine if applying automated machine learning using a transfer learning approach could accurately identify and segment swallows from swallowing sounds collected in preterm neonates. Thin fluid swallow sounds were collected from 78 preterm neonates, median birth age 34 weeks gestation (range 25-36 weeks, 52.6% males) across 3 Australian special care nurseries. For the base machine learning model, a deep convolutional neural network (DCNN) pre-trained for audio event classification was used. With raw swallow audio data as input, embedding vectors from the base DCNN were generated and used to train a feedforward neural network to determine the presence of a swallow within an audio segment. The model showed high overall accuracy (94%) in identifying preterm swallows. Better model performance on bottle feeding swallows (Sensitivity, 95%; and specificity, 96%) was seen compared with breastfeeding swallows (sensitivity, 95%, specificity 92%). Interpretation: Our novel study demonstrates the successful use of transfer learning to accurately identify and segment digital swallowing sounds in preterm neonates. Application of this model could support the development of a digital CA app to automatically classify swallow sounds and improve objectivity for CA use in clinical practice within special care nurseries.

Keywords: Cervical auscultation; Deglutition; Machine learning; Neonate; Preterm; Signal processing; Swallowing sounds.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors have a patent registration of the original algorithm used in this study.

References

    1. Mercado-Deane M-G, Burton EM, Harlow SA, et al. Swallowing dysfunction in infants less than 1 year of age. Pediatr Radiol. 2001;31(6):423–8. - PubMed
    1. Uhm KE, Yi S-H, Chang HJ, Cheon HJ, Kwon J-Y. Videofluoroscopic swallowing study findings in full-term and preterm infants with dysphagia. Ann Rehabil Med. 2013;37(2):175–82. - PubMed - PMC
    1. Patel T, Clemmens C, Bradburn K, et al. Effect of a standardized fluoroscopic procedural approach on fluoroscopy time during infant modified barium swallow studies. Int J Pediatr Otorhinolaryngol. 2020;138:110396–110396. - PubMed
    1. Sheikh S, Allen E, Shell R, et al. Chronic aspiration without gastroesophageal reflux as a cause of chronic respiratory symptoms in neurologically normal infants. Chest. 2001;120(4):1190–5. - PubMed
    1. Tutor JD, Srinivasan S, Gosa MM, Spentzas T, Stokes DC. Pulmonary function in infants with swallowing dysfunction. PLoS ONE. 2015;10(5): e0123125. - PubMed - PMC

LinkOut - more resources