Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jan 7;21(1):e1012108.
doi: 10.1371/journal.pcbi.1012108. eCollection 2025 Jan.

Bayesian active sound localisation: To what extent do humans perform like an ideal-observer?

Affiliations

Bayesian active sound localisation: To what extent do humans perform like an ideal-observer?

Glen McLachlan et al. PLoS Comput Biol. .

Abstract

Self-motion is an essential but often overlooked component of sound localisation. As the directional information of a source is implicitly contained in head-centred acoustic cues, that acoustic input needs to be continuously combined with sensorimotor information about the head orientation in order to decode to a world-centred frame of reference. When utilised, head movements significantly reduce ambiguities in the directional information provided by the incoming sound. In this work, we model human active sound localisation (considering small head rotations) as an ideal observer. In the evaluation, we compared human performance obtained in a free-field active localisation experiment with the predictions of a Bayesian model. Model noise parameters were set a-priori based on behavioural results from other studies, i.e., without any post-hoc parameter fitting to behavioural results. The model predictions showed a general agreement with actual human performance. However, a spatial analysis revealed that the ideal observer was not able to predict localisation behaviour for each source direction. A more detailed investigation into the effects of various model parameters indicated that uncertainty on head orientation significantly contributed to the observed differences. Yet, the biases and spatial distribution of the human responses remained partially unexplained by the presented ideal observer model, suggesting that human sound localisation is sub-optimal.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Example posterior distribution of source direction at different time steps during yaw rotation.
Darker areas indicate higher probabilities. The blue ‘x’ is the true source direction.
Fig 2
Fig 2. Centroid and Kent distribution for condition BP and source direction (30°, 30°).
Black dots are the individual subject responses, from which the Kent distribution was calculated.
Fig 3
Fig 3. Centroids and Kent distributions of behavioural (B) and modelled (M) responses in the passive (P) and active (A) conditions, averaged over eight subjects.
The rows show the same data viewed towards the front, the right, and the back of the head. Quadrant errors were excluded.
Fig 4
Fig 4. Quadrant error rates [%] of behavioural (B) and modelled (M) responses in the passive (P) and active (A) conditions, averaged over eight subjects.
The rows show the same data viewed towards the front and the back of the head.
Fig 5
Fig 5. Lateral RMSE, polar RMSE and QE rate of the modelled data as a function of σitd (in units of the JND).
Blue markers are passive results, orange markers are active results. The markers and the error bars represent the mean and standard deviation over the eight modelled subjects. For reference, the dashed lines and the coloured areas show the behavioural means and standard deviations over the eight subjects, respectively.
Fig 6
Fig 6. Lateral RMSE, polar RMSE and QE rate of the modelled data as a function head orientation measurement noise σH, with head control noise σu = 0° (left column) and σu = 8°.
Blue markers are passive results, orange markers are active results. The markers and the error bars represent the mean and standard deviation over the eight modelled subjects. For reference, the dashed lines and the coloured areas show the behavioural means and standard deviations over the eight subjects, respectively.
Fig 7
Fig 7. Elevation gain [24] and QE rate of the modelled data as a function of σp (in degrees).
Blue markers are passive results, orange markers are dynamic results. The markers and the error bars represent the mean and standard deviation over the eight modelled subjects. For reference, the dashed lines and the coloured areas show the behavioural means and standard deviations over the eight subjects, respectively.
Fig 8
Fig 8. Lateral RMSE, polar RMSE, and QE rates of the modelled data as a function of time step size Δt.
The symbols show the averages and the error bars represent ±1 SDs over the (virtual) subjects. For reference, the horizontal dashed lines show the behavioural data.

Similar articles

References

    1. Brimijoin WO, Boyd AW, Akeroyd MA. The contribution of head movement to the externalization and internalization of sounds. PloS one. 2013;8(12):e83068. doi: 10.1371/journal.pone.0083068 - DOI - PMC - PubMed
    1. Wallach H. The role of head movements and vestibular and visual cues in sound localization. Journal of Experimental Psychology. 1940;27(4):339. doi: 10.1037/h0054629 - DOI
    1. Perrett S, Noble W. The contribution of head motion cues to localization of low-pass noise. Perception & psychophysics. 1997;59(7):1018–1026. doi: 10.3758/BF03205517 - DOI - PubMed
    1. Jiang J, Xie B, Mai H, Liu L, Yi K, Zhang C. The role of dynamic cue in auditory vertical localisation. Applied Acoustics. 2019;146:398–408. doi: 10.1016/j.apacoust.2018.12.002 - DOI
    1. Giguère C, Abel SM. Sound localization: Effects of reverberation time, speaker array, stimulus frequency, and stimulus rise/decay. The Journal of the Acoustical Society of America. 1993;94(2):769–776. doi: 10.1121/1.408206 - DOI - PubMed

LinkOut - more resources