Expanding the V1-MT model to the estimation of perceived fluid direction

Takahiro Kawabe¹

Affiliations

PMID: 40287510
PMCID: PMC12033300
DOI: 10.1038/s41598-025-99069-7

Expanding the V1-MT model to the estimation of perceived fluid direction

Takahiro Kawabe. Sci Rep. 2025.

. 2025 Apr 26;15(1):14681.

doi: 10.1038/s41598-025-99069-7.

Author

Takahiro Kawabe¹

Affiliation

¹ NTT Communication Science Laboratories, 3-1, Morinosato Wakamiya, Atsugi, Kanagawa, 243-0198, Japan. takahiro.kawabe@ntt.com.

PMID: 40287510
PMCID: PMC12033300
DOI: 10.1038/s41598-025-99069-7

Abstract

Humans can readily perceive the direction of liquid flow, yet computational modeling of this process remains challenging due to the complexity of non-rigid motion. Previous models based on neural activities in the primary visual cortex (V1) and the middle temporal area (MT) have been effective in explaining rigid motion perception. In this study, we extend the V1-MT model to address the perception of liquid flow direction. Participants observed video clips of liquid flow and reported the perceived direction, while the V1-MT model was used to predict these perceptions. The winner-take-all approach failed to accurately capture the observed perceptions. In contrast, a weighted mean of directional energies yielded strong predictions, highlighting that the human visual system spatially integrates directional energies from non-rigid motion components. These findings broaden the applicability of the V1-MT model to non-rigid motion and provide insights into how the visual system bridges the gap between computational models of rigid and non-rigid motion perception.

Keywords: Liquid flow direction; Motion; V1-MT model; Weighted average.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: Takahiro Kawabe is an employee of Nippon Telegraph and Telephone Corporation. Declaration of generative AI and AI-assisted technologies in the writing process: During the preparation of this work the author used ChatGPT 4o in order to improve English expression of text. After using this service, the author reviewed and edited the content as needed and take(s) full responsibility for the content of the publication.

Figures

**Fig. 1**
(a) The processing pipeline of the V1-MT model used in the present study, illustrating the stages from spatiotemporal quadrature filtering to the computation of direction energy. (b) The distribution of direction energy across selective directions of MT neurons for different noise motion directions. (c) A box plot comparing the absolute errors in motion direction estimation between the winner-take-all and weighted averaging models.

**Fig. 2**
(a) Left: Snapshots of stimulus clips. Right: A snapshot of the experimental display showing a video frame of a stimulus clip accompanied by an arrow used by participants to report the perceived direction of liquid flow. (b) The red line represents the mean perceived direction for a stimulus clip across participants, while the gray thin arrows indicate individual perceived directions. The green dashed line and blue dotted line represent the directions inferred by the winner-take-all and weighted averaging models, respectively.

**Fig. 3**
(a) Direction energies summed across MT neurons and fitted with a double von Mises function. The green dashed line and blue dotted line represent the directions inferred by the winner-take-all and weighted averaging models, respectively. (b) Box plots of the absolute errors between participants’ perceived direction and the inferred directions from the winner-take-all (left) and weighted averaging (right) models.

**Fig. 4**
(a) Reported and inferred directions of liquid flow for each clip, plotted as a function of the start frame. Each clip consists of a 5-frame sequence of images. The values in the titles of the graphs indicate the absolute errors between the median reported direction and the inferred direction, as estimated by the weighted averaging (WA) and winner-take-all (WTA) models. (b) Box plots showing the absolute differences between reported and inferred directions for both the WA and WTA models across all clips.

See this image and copyright information in PMC

References

1. Kawabe, T., Maruya, K., Fleming, R. W. & Nishida, S. Seeing liquids from visual motion. Vis. Res109, 125–138 (2015). - DOI - PubMed
1. Paulun, V. C., Kawabe, T., Nishida, S. & Fleming, R. W. Seeing liquids from static snapshots. Vis. Res115, 163–174 (2015). - DOI - PubMed
1. van Assen, J. J. R., Barla, P. & Fleming, R. W. Visual features in the perception of liquids. Curr. Biol.28, 452-458.e4 (2018). - DOI - PMC - PubMed
1. Kawabe, T., Maruya, K. & Nishida, S. Perceptual transparency from image deformation. Proc. Natl. Acad. Sci.112, E4620–E4627 (2015). - DOI - PMC - PubMed
1. Kawabe, T. Linear motion coverage as a determinant of transparent liquid perception. Percept.9, 2041669518813375 (2018). - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Nature Publishing Group
- PubMed Central
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Expanding the V1-MT model to the estimation of perceived fluid direction

Affiliation

Expanding the V1-MT model to the estimation of perceived fluid direction

Author

Affiliation

Abstract

Conflict of interest statement

Figures

References

MeSH terms

LinkOut - more resources

Full Text Sources

Miscellaneous