Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Nov/Dec:41 Suppl 1:131S-139S.
doi: 10.1097/AUD.0000000000000955.

Auditory Measures for the Next Billion Users

Affiliations

Auditory Measures for the Next Billion Users

Malcolm Slaney et al. Ear Hear. 2020 Nov/Dec.

Abstract

A range of new technologies have the potential to help people, whether traditionally considered hearing impaired or not. These technologies include more sophisticated personal sound amplification products, as well as real-time speech enhancement and speech recognition. They can improve user's communication abilities, but these new approaches require new ways to describe their success and allow engineers to optimize their properties. Speech recognition systems are often optimized using the word-error rate, but when the results are presented in real time, user interface issues become a lot more important than conventional measures of auditory performance. For example, there is a tradeoff between minimizing recognition time (latency) by quickly displaying results versus disturbing the user's cognitive flow by rewriting the results on the screen when the recognizer later needs to change its decisions. This article describes current, new, and future directions for helping billions of people with their hearing. These new technologies bring auditory assistance to new users, especially to those in areas of the world without access to professional medical expertise. In the short term, audio enhancement technologies in inexpensive mobile forms, devices that are quickly becoming necessary to navigate all aspects of our lives, can bring better audio signals to many people. Alternatively, current speech recognition technology may obviate the need for audio amplification or enhancement at all and could be useful for listeners with normal hearing or with hearing loss. With new and dramatically better technology based on deep neural networks, speech enhancement improves the signal to noise ratio, and audio classifiers can recognize sounds in the user's environment. Both use deep neural networks to improve a user's experiences. Longer term, auditory attention decoding is expected to allow our devices to understand where a user is directing their attention and thus allow our devices to respond better to their needs. In all these cases, the technologies turn the hearing assistance problem on its head, and thus require new ways to measure their performance.

PubMed Disclaimer

References

    1. American Academy of Audiology(2018). The audiologist’s guide to hearing aids, PSAPs, hearables and OTC devices. https://www.audiology.org/sites/default/files/publications/resources/201...
    1. Apple. (2020). iPhone Accessibility. Retrieved from https://www.apple.com/accessibility/iphone/hearing/, Retrieved July 6, 2020
    1. Arora P. The Next Billion Users: Digital Life Beyond the West, (2019). Harvard University Press.
    1. Bisgaard N., Vlaming M. S., Dahlquist M. Standard audiograms for the IEC 60118-15 measurement procedure. Trends Amplif, (2010). 14, 113–120
    1. Bishop T. 2017). Microsoft claims new speech recognition record, achieving a super-human 5.1% error rate. Geekwire. https://www.geekwire.com/2017/microsoft-claims-new-speech-recognition-re...

Publication types

LinkOut - more resources