Generic HRTFs May be Good Enough in Virtual Reality. Improving Source Localization through Cross-Modal Plasticity

Christopher C Berger^{1

2}, Mar Gonzalez-Franco¹, Ana Tajadura-Jiménez^{3

4}, Dinei Florencio¹, Zhengyou Zhang^{1

5}

Affiliations

¹ Microsoft Research, Redmond, WA, United States.
² Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, United States.
³ UCL Interaction Centre, University College London, London, United Kingdom.
⁴ Interactive Systems DEI-Lab, Universidad Carlos III de Madrid, Madrid, Spain.
⁵ Department Electrical Engineering, University of Washington, Seattle, WA, United States.

PMID: 29456486
PMCID: PMC5801410
DOI: 10.3389/fnins.2018.00021

Generic HRTFs May be Good Enough in Virtual Reality. Improving Source Localization through Cross-Modal Plasticity

Christopher C Berger et al. Front Neurosci. 2018.

. 2018 Feb 2:12:21.

doi: 10.3389/fnins.2018.00021. eCollection 2018.

Authors

Christopher C Berger^{1

2}, Mar Gonzalez-Franco¹, Ana Tajadura-Jiménez^{3

4}, Dinei Florencio¹, Zhengyou Zhang^{1

5}

Affiliations

¹ Microsoft Research, Redmond, WA, United States.
² Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, United States.
³ UCL Interaction Centre, University College London, London, United Kingdom.
⁴ Interactive Systems DEI-Lab, Universidad Carlos III de Madrid, Madrid, Spain.
⁵ Department Electrical Engineering, University of Washington, Seattle, WA, United States.

PMID: 29456486
PMCID: PMC5801410
DOI: 10.3389/fnins.2018.00021

Abstract

Auditory spatial localization in humans is performed using a combination of interaural time differences, interaural level differences, as well as spectral cues provided by the geometry of the ear. To render spatialized sounds within a virtual reality (VR) headset, either individualized or generic Head Related Transfer Functions (HRTFs) are usually employed. The former require arduous calibrations, but enable accurate auditory source localization, which may lead to a heightened sense of presence within VR. The latter obviate the need for individualized calibrations, but result in less accurate auditory source localization. Previous research on auditory source localization in the real world suggests that our representation of acoustic space is highly plastic. In light of these findings, we investigated whether auditory source localization could be improved for users of generic HRTFs via cross-modal learning. The results show that pairing a dynamic auditory stimulus, with a spatio-temporally aligned visual counterpart, enabled users of generic HRTFs to improve subsequent auditory source localization. Exposure to the auditory stimulus alone or to asynchronous audiovisual stimuli did not improve auditory source localization. These findings have important implications for human perception as well as the development of VR systems as they indicate that generic HRTFs may be enough to enable good auditory source localization in VR.

Keywords: HRTF (head related transfer function); auditory perception; auditory training; cross-modal perception; cross-modal plasticity; spatial audio; virtual reality.

PubMed Disclaimer

Figures

**Figure 1**
Experimental setup. **(A)** The participants were equipped with the VR headset and could identify and report the source of sounds originating from five different locations (± 26.6°, ± 11.3°, 0°) along a white bar that was located 10 m in front of the participant and spanned 73.74° along the azimuth. **(B)** First person perspective within the VR environment during the auditory localization task. (The person in the picture is an author of the paper and gave consent to publish an identifiable image of him).

**Figure 2**
Results from all experiments. **(A)** Box-plots of the auditory remapping for all experiments. A significant improvement of the participants' auditory localization error wasand the localization test in a observed following the 60 s Audiovisual (AV) exposure. No such improvement was observed following the **Auditory Only** exposure. In the experiment on the effect of impact sounds, improved auditory source localization was observed following the synchronous audiovisual exposure phase with the additional impact related auditory cues (**AV + Impact Sync**). No significant remapping was observed following exposure to asynchronous but spatially aligned audiovisual stimuli (**AV + Impact Async**), or when the training was done in one sound and the localization was tested using a different sound **(V + Impact Sync)**. **(B)** Mean pre- and post- adaptation localization errors for all participants, with each participant's data represented by pair of dots connected by a line. Asterisks indicate significant difference between pre-exposure and post-exposure phases (^*p < 0.05) and “n.s.” indicates that there was no significant difference between pre- and post-exposure phases (p > 0.05).

See this image and copyright information in PMC

Cited by

Visual Influences on Auditory Behavioral, Neural, and Perceptual Processes: A Review.
Opoku-Baah C, Schoenhaut AM, Vassall SG, Tovar DA, Ramachandran R, Wallace MT. Opoku-Baah C, et al. J Assoc Res Otolaryngol. 2021 Jul;22(4):365-386. doi: 10.1007/s10162-021-00789-0. Epub 2021 May 20. J Assoc Res Otolaryngol. 2021. PMID: 34014416 Free PMC article. Review.
The Influence of Auditory Cues on Bodily and Movement Perception.
Stanton TR, Spence C. Stanton TR, et al. Front Psychol. 2020 Jan 17;10:3001. doi: 10.3389/fpsyg.2019.03001. eCollection 2019. Front Psychol. 2020. PMID: 32010030 Free PMC article. Review.
Multisensory stimuli facilitate low-level perceptual learning on a diﬃcult global motion task in virtual reality.
Fromm CA, Maddox RK, Polonenko MJ, Huxlin KR, Diaz GJ. Fromm CA, et al. PLoS One. 2025 Mar 4;20(3):e0319007. doi: 10.1371/journal.pone.0319007. eCollection 2025. PLoS One. 2025. PMID: 40036211 Free PMC article.
Auditory localization: a comprehensive practical review.
Carlini A, Bordeau C, Ambard M. Carlini A, et al. Front Psychol. 2024 Jul 10;15:1408073. doi: 10.3389/fpsyg.2024.1408073. eCollection 2024. Front Psychol. 2024. PMID: 39049946 Free PMC article. Review.
Rethinking GPS navigation: creating cognitive maps through auditory clues.
Clemenson GD, Maselli A, Fiannaca AJ, Miller A, Gonzalez-Franco M. Clemenson GD, et al. Sci Rep. 2021 Apr 8;11(1):7764. doi: 10.1038/s41598-021-87148-4. Sci Rep. 2021. PMID: 33833290 Free PMC article.

See all "Cited by" articles

References

1. Bauer R., Matvzsa J., Blackmer R. (1966). Noise localization after unilateral attenuation. J. Acoust. Soc. Am. 40, 441–444. 10.1121/1.1910093 - DOI
1. Berger C. C., Ehrsson H. H. (in press). Mental imagery induces cross-modal plasticity changes future auditory perception. Psychol. Sci. - PubMed
1. Berger C. C., Ehrsson H. H. (2016). Auditory motion elicits a visual motion aftereffect. Front. Neurosci. 10:559. 10.3389/fnins.2016.00559 - DOI - PMC - PubMed
1. Bergström I., Azevedo S., Papiotis P., Saldanha N., Slater M. (2017). The plausibility of a string quartet performance in virtual reality. IEEE Trans. Visual. Comp. Graph. 23, 1352–1359. 10.1109/TVCG.2017.2657138 - DOI - PubMed
1. Bertelson P., Aschersleben G. (1998). Automatic visual bias of perceived auditory location. Psychon. Bull. Rev. 5, 482–489. 10.3758/BF03208826 - DOI

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Generic HRTFs May be Good Enough in Virtual Reality. Improving Source Localization through Cross-Modal Plasticity

Affiliations

Generic HRTFs May be Good Enough in Virtual Reality. Improving Source Localization through Cross-Modal Plasticity

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources

Other Literature Sources