Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr 26;15(1):14681.
doi: 10.1038/s41598-025-99069-7.

Expanding the V1-MT model to the estimation of perceived fluid direction

Affiliations

Expanding the V1-MT model to the estimation of perceived fluid direction

Takahiro Kawabe. Sci Rep. .

Abstract

Humans can readily perceive the direction of liquid flow, yet computational modeling of this process remains challenging due to the complexity of non-rigid motion. Previous models based on neural activities in the primary visual cortex (V1) and the middle temporal area (MT) have been effective in explaining rigid motion perception. In this study, we extend the V1-MT model to address the perception of liquid flow direction. Participants observed video clips of liquid flow and reported the perceived direction, while the V1-MT model was used to predict these perceptions. The winner-take-all approach failed to accurately capture the observed perceptions. In contrast, a weighted mean of directional energies yielded strong predictions, highlighting that the human visual system spatially integrates directional energies from non-rigid motion components. These findings broaden the applicability of the V1-MT model to non-rigid motion and provide insights into how the visual system bridges the gap between computational models of rigid and non-rigid motion perception.

Keywords: Liquid flow direction; Motion; V1-MT model; Weighted average.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: Takahiro Kawabe is an employee of Nippon Telegraph and Telephone Corporation. Declaration of generative AI and AI-assisted technologies in the writing process: During the preparation of this work the author used ChatGPT 4o in order to improve English expression of text. After using this service, the author reviewed and edited the content as needed and take(s) full responsibility for the content of the publication.

Figures

Fig. 1
Fig. 1
(a) The processing pipeline of the V1-MT model used in the present study, illustrating the stages from spatiotemporal quadrature filtering to the computation of direction energy. (b) The distribution of direction energy across selective directions of MT neurons for different noise motion directions. (c) A box plot comparing the absolute errors in motion direction estimation between the winner-take-all and weighted averaging models.
Fig. 2
Fig. 2
(a) Left: Snapshots of stimulus clips. Right: A snapshot of the experimental display showing a video frame of a stimulus clip accompanied by an arrow used by participants to report the perceived direction of liquid flow. (b) The red line represents the mean perceived direction for a stimulus clip across participants, while the gray thin arrows indicate individual perceived directions. The green dashed line and blue dotted line represent the directions inferred by the winner-take-all and weighted averaging models, respectively.
Fig. 3
Fig. 3
(a) Direction energies summed across MT neurons and fitted with a double von Mises function. The green dashed line and blue dotted line represent the directions inferred by the winner-take-all and weighted averaging models, respectively. (b) Box plots of the absolute errors between participants’ perceived direction and the inferred directions from the winner-take-all (left) and weighted averaging (right) models.
Fig. 4
Fig. 4
(a) Reported and inferred directions of liquid flow for each clip, plotted as a function of the start frame. Each clip consists of a 5-frame sequence of images. The values in the titles of the graphs indicate the absolute errors between the median reported direction and the inferred direction, as estimated by the weighted averaging (WA) and winner-take-all (WTA) models. (b) Box plots showing the absolute differences between reported and inferred directions for both the WA and WTA models across all clips.

Similar articles

References

    1. Kawabe, T., Maruya, K., Fleming, R. W. & Nishida, S. Seeing liquids from visual motion. Vis. Res109, 125–138 (2015). - PubMed
    1. Paulun, V. C., Kawabe, T., Nishida, S. & Fleming, R. W. Seeing liquids from static snapshots. Vis. Res115, 163–174 (2015). - PubMed
    1. van Assen, J. J. R., Barla, P. & Fleming, R. W. Visual features in the perception of liquids. Curr. Biol.28, 452-458.e4 (2018). - PMC - PubMed
    1. Kawabe, T., Maruya, K. & Nishida, S. Perceptual transparency from image deformation. Proc. Natl. Acad. Sci.112, E4620–E4627 (2015). - PMC - PubMed
    1. Kawabe, T. Linear motion coverage as a determinant of transparent liquid perception. Percept.9, 2041669518813375 (2018). - PMC - PubMed

LinkOut - more resources