Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2012 Apr 5;367(1591):954-64.
doi: 10.1098/rstb.2011.0365.

Multistability, cross-modal binding and the additivity of conjoined grouping principles

Affiliations
Review

Multistability, cross-modal binding and the additivity of conjoined grouping principles

Michael Kubovy et al. Philos Trans R Soc Lond B Biol Sci. .

Abstract

We present a sceptical view of multimodal multistability--drawing most of our examples from the relation between audition and vision. We begin by summarizing some of the principal ways in which audio-visual binding takes place. We review the evidence that unambiguous stimulation in one modality may affect the perception of a multistable stimulus in another modality. Cross-modal influences of one multistable stimulus on the multistability of another are different: they have occurred only in speech perception. We then argue that the strongest relation between perceptual organization in vision and perceptual organization in audition is likely to be by way of analogous Gestalt laws. We conclude with some general observations about multimodality.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Köhler's demonstration of binding by common embodiment. (a) The maluma figure. (b) The takete figure.
Figure 2.
Figure 2.
Examples of coherent and unrelated ambiguities: the Adams & Haire [21] nested cubes. (a) Same orientations. (b) Opposite orientations.
Figure 3.
Figure 3.
An example of unrelated ambiguities: Ambiguity 1: is the Necker cube interpreted as a cube seen from a vantage point above to its right or from a vantage point below to its left? Ambiguity 2: is the paper cutout of a Necker cube seen floating in front of eight black disks painted on a white background (illustrated in figure 3b) or is it seen through eight holes in a white surface, against a black backdrop? (illustrated in figure 3d and rendered in figure 3c) (a) The Bradley & Petry [22] cube. (b) Default interpretation: modal completion. (c) The modified Bradley & Petry [22] cube. (d) Alternative interpretation: amodal completion).
Figure 4.
Figure 4.
Two examples of grouping. (a) Visual. (b) Auditory.
Figure 5.
Figure 5.
Theory of indispensable attributes: the visual thought-experiment. (a) Start. (b) Collapse over wavelength: not indispensable. (c) Collapse over space: indispensable.
Figure 6.
Figure 6.
Theory of indispensable attributes: the auditory thought-experiment. (a) Start. (b) Collapse over space: not indispensable. (c) Collapse over frequency: indispensable.
Figure 7.
Figure 7.
Four-tone bistable tone sequence used by Cook & Van Valkenburg [33]. (a) The middle tones are heard grouped with the high tone, leaving the low tone perceptually isolated. (b) The middle tones are heard grouped with the low tone, leaving the high tone perceptually isolated.
Figure 8.
Figure 8.
Wertheimer's [44] rectangular dimotif dot lattices apply grouping by proximity and grouping by similarity concurrently to the same stimuli. (a) Wertheimer's figure (xii): both proximity and similarity favour columns (in terms defined in figure 9, |b|/|a| =1.083). (b) Wertheimer's figure (xiii): proximity favours columns (in terms defined in figure 9, |b|/|a| = 1.104) but similarity favours rows.
Figure 9.
Figure 9.
Dot lattices. (a) Defining features. (b) Two-dimensional space and nomenclature.
Figure 10.
Figure 10.
The attraction function of grouping by proximity.
Figure 11.
Figure 11.
Two dimotif dot lattices. In both, grouping by proximity favours a, but more weakly in the dot lattice on left, where |a| = 1.25|b|, than in the dot lattice on the right, where |a| = 1.5|b|. In the dot lattice on the left, grouping by similarity favours b3 (δ > 0, where δ is a measure of dissimilarity between two kinds of dots), whereas in the dot lattice on the right it favours a (δ < 0), but because the differences between dot-colours are smaller on the left than on the right, the strength of the grouping by similarity (δ = 2) on the left is smaller than the strength of grouping by similarity on the right (δ =− 3).
Figure 12.
Figure 12.
A schematic of the results obtained by Kubovy & van den Berg [54] using the dimotif lattices described in figure 11, showing that the conjoined effects of proximity and similarity are additive when choice probabilities are represented as log-odds. The line for δ = 0 (light grey) corresponds to the attraction function in figure 10, i.e. all the dots have the same colour, and therefore, grouping by similarity cannot affect the results. The results are equivalent to the multiplicative model of figure 13.
Figure 13.
Figure 13.
Multiplicative model of conjoined grouping that implies additivity of grouping by proximity and grouping by similarity when choice probabilities are represented as log-odds, as in figure 12.
Figure 14.
Figure 14.
An auditory necklace 11100110 of length n = 8; i.e. it is eight beats long. It has five notes and three rests, grouped into two sequences of notes called runs, and two sequences of rests called gaps.
Figure 15.
Figure 15.
The response screen. At the moment depicted, the cross is highlighted.
Figure 16.
Figure 16.
p(rA ) as a function Δ r and Δ g. Error bars span ± 1 s.e.

Similar articles

Cited by

References

    1. McGurk H., MacDonald J. 1976. Hearing lips and seeing voices. Nature 264, 746–74810.1038/264746a0 (doi:10.1038/264746a0) - DOI - DOI - PubMed
    1. Jack C. E., Thurlow W. R. 1973. Effects of degree of visual association and angle of displacement on the ‘ventriloquism’ effect. Percept. Mot. Skills 37, 967–97910.2466/pms.1973.37.3.967 (doi:10.2466/pms.1973.37.3.967) - DOI - DOI - PubMed
    1. Thurlow W. R., Jack C. E. 1973. Certain determinants of the ‘ventriloquism effect’. Percept. Mot. Skills 36, 1171–118410.2466/pms.1973.36.3c.1171 (doi:10.2466/pms.1973.36.3c.1171) - DOI - DOI - PubMed
    1. van Wassenhove V., Grant K. W., Poeppel D. 2007. Temporal window of integration in auditory-visual speech perceptions. Neuropsychologia 45, 598–60710.1016/j.neuropsychologia.2006.01.001 (doi:10.1016/j.neuropsychologia.2006.01.001) - DOI - DOI - PubMed
    1. Bertelson P., Vroomen J., de Gelder B. 2003. Visual recalibration of auditory speech identification: a McGurk aftereffect. Psychol. Sci. 14, 592–59710.1046/j.0956-7976.2003.psci_1470.x (doi:10.1046/j.0956-7976.2003.psci_1470.x) - DOI - DOI - PubMed

LinkOut - more resources