Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Apr 13;20(7):649-56.
doi: 10.1016/j.cub.2010.02.032. Epub 2010 Mar 18.

Human-monkey gaze correlations reveal convergent and divergent patterns of movie viewing

Affiliations

Human-monkey gaze correlations reveal convergent and divergent patterns of movie viewing

Stephen V Shepherd et al. Curr Biol. .

Abstract

The neuroanatomical organization of the visual system is largely similar across primate species, predicting similar visual behaviors and perceptions. Although responses to trial-by-trial presentation of static images suggest that primates share visual orienting strategies, these reduced stimuli fail to capture key elements of the naturalistic, dynamic visual world in which we evolved. Here, we compared the gaze behavior of humans and macaques when they viewed three different 3-minute movie clips. We found significant intersubject and interspecies gaze correlations, suggesting that both species attend a common set of events in each scene. Comparing human and monkey gaze behavior with a computational saliency model revealed that interspecies gaze correlations were driven by biologically relevant social stimuli overlooked by low-level saliency models. Additionally, humans, but not monkeys, tended to gaze toward the targets of viewed individual's actions or gaze. Together, these data suggest that human and monkey gaze behavior comprises converging and diverging informational strategies, driven by both scene content and context; they are not fully described by simple low-level visual models.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Video scenes with superimposed gaze traces
Example frames from the Life of Mammals (A), the Jungle Book (B), and City Lights (C), shown with superimposed gaze coordinates (monkeys green, humans blue).
Figure 2
Figure 2. Correlation in gaze location across scanpaths
Correlations are evident across humans, monkeys, and between species. Here, scanpaths are split into vertical and horizontal coordinates, and plotted for (A) two humans paths viewing City Lights (local correlation r=0.66), (B) a human and a monkey viewing the Jungle Book (local correlation r=0.08), and (C) one monkey during repeated viewings of the Life of Mammals (local correlation r=0.36). The local correlations are typical of the inter-human, inter-species, and intra-monkey scanpaths, respectively. Significant correlations existed between primate scanpaths produced in response to the same video clip. Both spatial position (D) and eye movement speed (E) were correlated across primate scanpaths, whether produced by the same individual or a different individual and whether produced by a human (blue) or a monkey (green). Gray bars indicate the permutation baseline for α = 0.05 (thick) or 0.001 (thin): all primate gaze ISCs were significant with α < .001. Artificial scanpaths produced by a low-level visual saliency model (orange) were significantly correlated with primate scanpaths in spatial position (α < 0.001) and gaze shift timing (α < 0.05); however, residual inter-primate correlations (yellow) were essentially unchanged despite partialling out shared similarities to artificial scanpaths. Finally, to confirm that behavioral correlations were driven by visual fixation priorities, we compared the percentage of gaze samples which overlapped (±3.5°) across different scanpaths (F): The pattern of results was identical to the pattern observed for inter-subject correlation. Furthermore, we found that samples that overlapped between humans and monkeys rarely overlapped with the best-performing artificial scanpath (2 of 31%, see inset), These data rule out the hypothesis that gaze correlations were driven primarily by low-level visual features, at least as characterized by well-established neuromorphic computational saliency models [27, 29].
Figure 3
Figure 3. Comparison of gaze reliability across time, by species
The standard deviation of simultaneously-recorded gaze coordinates from humans (A), monkeys (B) and both species (C) can, in conjunction, differentiate the visual strategies of monkeys or humans. At right we show frames during a clip from the Life of Mammals that most dispersed attention (D, a long shot featuring a number of monkeys), that sometimes gathered attention (E, social stimuli captured sustained human interest, while newly-visible scenery during a pan was quickly surveyed by monkeys). Faces often captured the attention of all primates (F), while dyadic social interactions sometimes produced separate gaze clusters for humans and macaques (G).
Figure 4
Figure 4. Higher-level social cues contribute to observed gaze correlations
We selected video frames in which observed gaze was highly clustered at a location far from that chosen by the low-level saliency model, examined scene content at the human, monkey, and simulated gaze coordinates, and tallied the percentage of frames in which scene content comprised biologically-relevant stimuli. Relative to image regions selected by artificial scanpaths, regions that were consistently viewed by humans and monkeys often featured social agents and the targets of their actions or attentions. Humans and monkeys fixated bodies and faces significantly more than low-level simulations, especially the eye and (for humans) mouth regions. Humans, but not monkeys, fixated socially-cued regions – for example, things being reached or gazed toward – significantly more often than predicted by the model. This effect was not mediated just by hand motion, which attracted gaze significantly more often for the simulation than for actual humans or monkeys.

References

    1. Kaas JH, Preuss TM. Archontan affinities as reflected in the visual system. In: Szalay FS, Novacek MJ, McKenna MC, editors. Mammalian phylogeny. New York: Springer-Verlag; 1993. pp. 115–128.
    1. Krubitzer LA, Kaas JH. Cortical Connections of Mt in 4 Species of Primates - Areal, Modular, and Retinotopic Patterns. Visual Neuroscience. 1990;5:165–204. - PubMed
    1. Ghazanfar AA, Nielsen K, Logothetis NK. Eye movements of monkeys viewing vocalizing conspecifics. Cognition. 2006;101:515–529. - PubMed
    1. Gothard KM, Brooks KN, Peterson MA. Multiple perceptual strategies used by macaque monkeys for face recognition. Animal Cognition. 2009;12:155–167. - PubMed
    1. Gothard KM, Erickson CA, Amaral DG. How do rhesus monkeys (Macaca mulatta) scan faces in a visual paired comparison task? Animal Cognition. 2004;7:25–36. - PubMed

Publication types

LinkOut - more resources