Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Nov 24;15(1):45225.
doi: 10.1038/s41598-025-28930-6.

Spectral peak picking improves tactile speech perception

Affiliations

Spectral peak picking improves tactile speech perception

Mark D Fletcher et al. Sci Rep. .

Abstract

Individual differences in speech perception often arise from disparities in access to acoustic-phonetic cues, particularly among those with hearing loss. Haptic hearing aids, which convey speech information through the sense of touch, offer a complementary pathway to improve speech understanding. However, effectively transmitting critical speech features through vibrotactile stimulation remains challenging.To address this challenge, we introduce a tactile spectral peak picking (tSPP) approach, integrated into a vocoder-based audio-to-tactile conversion algorithm to enhance vibrotactile phoneme discrimination. The tactile vocoder decomposes audio into eight frequency bands, with tSPP selectively transmitting only the most energetic bands to emphasise dominant spectral features.Tactile phoneme discrimination on the wrist was tested in 26 participants using either the tactile vocoder alone or with the tSPP algorithm selecting one, two, or four peaks. Discrimination improved significantly when one, two, or four peaks were selected relative to the vocoder alone, with the greatest benefits observed for one- and two-peak tSPP (average improvement: 7.5%).These findings demonstrate that selective enhancement of spectrally salient features can improve tactile speech perception. The algorithm is suitable for real-time use in wearable sensory substitution devices and could aid the development of effective haptic hearing aids.

Keywords: Audio-tactile; Cochlear implant; Hearing aid; Multisensory; Speech reading; Tactile aid.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Phoneme discrimination performance for different numbers of peaks picked with tSPP. Results are shown separately for consonants and vowels and for the male (left panel) and female (right panel) talkers. Each box plot displays the median (horizontal line), upper and lower quartiles (box edges), and outliers (unfilled circles; defined as values more than 1.5 times the interquartile range). Whiskers indicate the full non-outlier range. Chance-level performance is marked by a dashed grey line.
Fig. 2
Fig. 2
Performance for different consonant (left panel) and vowels (right panel) subgroups, under each tSPP condition. The box plots follow the format of Fig. 1.
Fig. 3
Fig. 3
Spectrograms showing the audio (left panel) and the tactile envelopes with either one- or two-peak tSPP (central and right panels, respectively) for the voiced fricative consonant /z/, spoken by the female talker. The frequency band limits used for the tactile vocoder in the current experiment are marked with light blue dashed lines. The audio spectrogram sample rate was 22.05 kHz, and a 30-ms Hamming window with a hop size of 1 sample was used. The sample rate for the tactile spectrograms was 500 Hz, with no windowing applied. For the input audio, intensity is shown in decibels relative to the maximum magnitude of the short-time Fourier transform. For the tactile envelopes, intensity is shown in decibels relative to the maximum envelope amplitude.
Fig. 4
Fig. 4
Block diagram showing the tactile vocoder signal-processing chain used to convert sound into vibration.
Fig. 5
Fig. 5
3D renders of the EHS Research Group haptic stimulation rig. Left: the rig without the participant’s arm. Right: close-up view with the participant’s arm resting on the blue foam cushion and the shaker probe contacting the dorsal wrist. Image reproduced from Fletcher et al. with permission of the authors.

References

    1. Fletcher, M. D., Akis, E., Verschuur, C. A. & Perry, S. W. Improved tactile speech perception and noise robustness using audio-to-tactile sensory substitution with amplitude envelope expansion. Sci. Rep.14, 15029. 10.1038/s41598-024-65510-6 (2024). - DOI - PMC - PubMed
    1. Fletcher, M. D., Verschuur, C. A. & Perry, S. W. Improving speech perception for hearing-impaired listeners using audio-to-tactile sensory substitution with multiple frequency channels. Sci. Rep.13, 13336. 10.1038/s41598-023-40509-7 (2023). - DOI - PMC - PubMed
    1. Brooks, P. L., Frost, B. J., Mason, J. L. & Chung, K. Acquisition of a 250-word vocabulary through a tactile vocoder. J. Acoust. Soc. Am.77, 1576–1579. 10.1121/1.392000 (1985). - DOI - PubMed
    1. Fletcher, M. D., Hadeedi, A., Goehring, T. & Mills, S. R. Electro-haptic enhancement of speech-in-noise performance in cochlear implant users. Sci. Rep.9, 11428. 10.1038/s41598-019-47718-z (2019). - DOI - PMC - PubMed
    1. Fletcher, M. D., Song, H. & Perry, S. W. Electro-haptic stimulation enhances speech recognition in spatially separated noise for cochlear implant users. Sci. Rep.10, 12723. 10.1038/s41598-020-69697-2 (2020). - DOI - PMC - PubMed

LinkOut - more resources