Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Feb:122:108289.
doi: 10.1016/j.patcog.2021.108289. Epub 2021 Aug 30.

AI-Based human audio processing for COVID-19: A comprehensive overview

Affiliations

AI-Based human audio processing for COVID-19: A comprehensive overview

Gauri Deshpande et al. Pattern Recognit. 2022 Feb.

Abstract

The Coronavirus (COVID-19) pandemic impelled several research efforts, from collecting COVID-19 patients' data to screening them for virus detection. Some COVID-19 symptoms are related to the functioning of the respiratory system that influences speech production; this suggests research on identifying markers of COVID-19 in speech and other human generated audio signals. In this article, we give an overview of research on human audio signals using 'Artificial Intelligence' techniques to screen, diagnose, monitor, and spread the awareness about COVID-19. This overview will be useful for developing automated systems that can help in the context of COVID-19, using non-obtrusive and easy to use bio-signals conveyed in human non-speech and speech audio productions.

Keywords: Audio processing; COVID-19; Computational paralinguistics; Digital health.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
Capturing and processing audio signals including speech for COVID-19 applications.
Fig. 2
Fig. 2
Groups (given on the x-axis) that collected and analysed cough, speech, and breathing data as indicated. Although some groups collected all three types of data, they have reported their results based on the analysis of only one of them. The y-axis indicates the frequencies of the healthy and C19 subjects present in the data set. Coughvid, VoiceMed and Spira have reported number of data points; we report here number of subjects. The data sets from Cambridge, Coswara, and Coughvid are publicly available. C & B: Cough & Breath; C, S & B: Cough, Speech & Breath. On the x-axis, reference to bibliography is given in square brackets & number without bracket refers to footnote.
Fig. 3
Fig. 3
Acoustic features’ & Machine learning techniques’ usage with the performance reported by different groups (on x-axis) for detecting COVID-19. The first row ‘+C19 subjects’ gives the C19 positive subjects’ count used by the respective groups; sequence of groups same as in Fig. 2. The features used by each group are indicated by the block colour: MFCC; SG: Spectrograms; VFO: Vocal fold Vibrations. Performance reported in the form of A: Accuracy, Se: Sensitivity, Sp: Specificity, and AUC. LR: Logistic regression. ‘Coswara’ and ‘Coughvid’ have not done any analysis with the data set they collected, hence blank blocks are shown for them. The results reported by ’Cambridge’ are: Combined analysis using cough and breath, C: Cough only and B: Breath only. On the x-axis, reference to bibliography is given in square brackets & number without bracket refers to footnote.

References

    1. Hanson K.E., Caliendo A.M., Arias C.A., Englund J.A., Lee M.J., Loeb M., Patel R., El Alayli A., Kalot M.A., Falck-Ytter Y., Lavergne V., Morgan R.L., Murad M.H., Sultan S., Bhimraj A., Mustafaand R.A. Infectious diseases society of america guidelines on the diagnosis of COVID-19. Clin. Infect. Dis. 2020:1–27. - PMC - PubMed
    1. B. W. Schuller, D. M. Schuller, K. Qian, J. Liu, H. Zheng, X. Li, COVID-19 and computer audition: an overview on what speech & sound analysis could contribute in the SARS-cov-2 corona crisis, arXiv:2003.11117 (2020). - PMC - PubMed
    1. G. Deshpande, B. Schuller, An overview on audio, signal, speech, & language processing for COVID-19, arXiv:2005.08579 (2020).
    1. Drugman T., Urbain J., Bauwens N., Chessini R., Valderrama C., Lebecque P., Dutoit T. Objective study of sensor relevance for automatic cough detection. IEEE J. Biomed. Health Inform. 2013;17(3):699–707. - PubMed
    1. Pramono R.X.A., Imtiaz S.A., Rodriguez-Villegas E. Proceedings of the 41st Annual International Conference of the Engineering in Medicine and Biology Society (EMBC) IEEE; Berlin, Germany: 2019. Automatic cough detection in acoustic signal using spectral features; pp. 7153–7156. - PubMed

LinkOut - more resources