Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep 20:S0892-1997(24)00283-2.
doi: 10.1016/j.jvoice.2024.08.029. Online ahead of print.

Evidence-Based Recommendations for Tablet Recordings From the Bridge2AI-Voice Acoustic Experiments

Affiliations

Evidence-Based Recommendations for Tablet Recordings From the Bridge2AI-Voice Acoustic Experiments

Shaheen N Awan et al. J Voice. .

Abstract

Background: As part of a larger goal to create best practices for voice data collection to fuel voice artificial intelligence (AI) research, the objective of this study was to investigate the ability of readily available iOS and Android tablets with and without low-cost headset microphones to produce recordings and subsequent acoustic measures of voice comparable to "research quality" instrumentation.

Methods: Recordings of 24 sustained vowel samples representing a wide range of typical and disordered voices were played via a head-and-torso model and recorded using a research quality standard microphone/preamplifier/audio interface. Acoustic measurements from the standard were compared with two popular tablets using their built-in microphones and with low-cost headset microphones at different distances from the mouth.

Results: Voice measurements obtained via tablets + headset microphones close to the mouth (2.5 and 5 cm) strongly correlated (r's > 0.90) with the research standard and resulted in no significant differences for measures of vocal frequency and perturbation. In contrast, voice measurements obtained using the tablets' built-in microphones at typical reading distances (30 and 45 cm) tended to show substantial variability in measurement, greater mean differences in voice measurements, and relatively poorer correlations vs the standard.

Conclusion: Findings from this study support preliminary recommendations from the Bridge2AI-Voice Consortium recommending the use of smartphones paired with low-cost headset microphones as adequate methods of recording for large-scale voice data collection from a variety of clinical and nonclinical settings. Compared with recording using a tablet direct, a headset microphone controls for recording distance and reduces the effects of background noise, resulting in decreased variability in recording quality.

Data availability: Data supporting the results reported in this article may be obtained upon request from the contact author.

Keywords: AI research—Voice—Acoustic analysis—Perturbation—Cepstral analysis.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest Nothing to disclose.

References

    1. Oates J. Auditory-perceptual evaluation of disordered voice quality. Pros, cons and future directions. Folia Phoniatr Logop. 2009;61:49–56. 10.1159/000200768. - DOI - PubMed
    1. Sara JDS, Orbelo D, Maor E, et al. Guess what we can hear—novel voice biomarkers for the remote detection of disease. Mayo Clin Proc. 2023;98:1353–1375. 10.1016/j.mayocp.2023.03.007. - DOI - PMC - PubMed
    1. Zraick RI, Kempster GB, Connor NP, et al. Establishing validity of the consensus auditory-perceptual evaluation of voice (CAPE-V). Am J Speech Lang Pathol. 2011;20:14–22. 10.1044/1058-0360(2010/09-0105). - DOI - PubMed
    1. Hillenbrand JM. Acoustic analysis of voice: a tutorial. Perspect Speech Sci Orofac Disord. 2011;21:31–43. 10.1044/SSOD21.2.31. - DOI
    1. Grillo EU, Brosious JN, Sorrell SL, et al. Influence of smartphones and software on acoustic voice measures. Int J Telerehabil. 2016;8:9–14. 10.5195/ijt.2016.6202. - DOI - PMC - PubMed

LinkOut - more resources