Multiagent off-screen behavior prediction in football

Shayegan Omidshafiei¹, Daniel Hennes², Marta Garnelo², Zhe Wang², Adria Recasens², Eugene Tarassov², Yi Yang², Romuald Elie², Jerome T Connor², Paul Muller², Natalie Mackraz², Kris Cao², Pol Moreno², Pablo Sprechmann², Demis Hassabis², Ian Graham³, William Spearman³, Nicolas Heess², Karl Tuyls⁴

Affiliations

¹ DeepMind, London, UK. somidshafiei@google.com.
² DeepMind, London, UK.
³ Liverpool Football Club, Liverpool, UK.
⁴ DeepMind, London, UK. karltuyls@deepmind.com.

PMID: 35606400
PMCID: PMC9126960
DOI: 10.1038/s41598-022-12547-0

Multiagent off-screen behavior prediction in football

Shayegan Omidshafiei et al. Sci Rep. 2022.

. 2022 May 23;12(1):8638.

doi: 10.1038/s41598-022-12547-0.

Authors

Affiliations

¹ DeepMind, London, UK. somidshafiei@google.com.
² DeepMind, London, UK.
³ Liverpool Football Club, Liverpool, UK.
⁴ DeepMind, London, UK. karltuyls@deepmind.com.

PMID: 35606400
PMCID: PMC9126960
DOI: 10.1038/s41598-022-12547-0

Abstract

In multiagent worlds, several decision-making individuals interact while adhering to the dynamics constraints imposed by the environment. These interactions, combined with the potential stochasticity of the agents' dynamic behaviors, make such systems complex and interesting to study from a decision-making perspective. Significant research has been conducted on learning models for forward-direction estimation of agent behaviors, for example, pedestrian predictions used for collision-avoidance in self-driving cars. In many settings, only sporadic observations of agents may be available in a given trajectory sequence. In football, subsets of players may come in and out of view of broadcast video footage, while unobserved players continue to interact off-screen. In this paper, we study the problem of multiagent time-series imputation in the context of human football play, where available past and future observations of subsets of agents are used to estimate missing observations for other agents. Our approach, called the Graph Imputer, uses past and future information in combination with graph networks and variational autoencoders to enable learning of a distribution of imputed trajectories. We demonstrate our approach on multiagent settings involving players that are partially-observable, using the Graph Imputer to predict the behaviors of off-screen players. To quantitatively evaluate the approach, we conduct experiments on football matches with ground truth trajectory data, using a camera module to simulate the off-screen player state estimation setting. We subsequently use our approach for downstream football analytics under partial observability using the well-established framework of pitch control, which traditionally relies on fully observed data. We illustrate that our method outperforms several state-of-the-art approaches, including those hand-crafted for football, across all considered metrics.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Figure 1**
Stylized visualization of the multiagent time-series imputation setting. (a) Agent trajectories up to and including time t. Dark blue indicates trajectory portions that are observed (with light indicating otherwise); the camera field of view at the current time t is indicated in grey. (b) Visualization of masks $m$ for all timesteps, where $m_{t}^{i} = 1$ where dark, and $m_{t}^{i} = 0$ where light. The mask at time t, which corresponds to the frame shown in (a), is highlighted in grey.

**Figure 2**
Graph Imputer model. Our model imputes missing information at each timestep using a combination of bidirectional LSTMs and graph networks. An exposition of a forward-direction update (corresponding to directionalupdate in Algorithm 1 in the “Methods” section) is provided in the left portion of the figure. Dark blue boxes indicate trajectory segments that are observed for each agent (with light blue indicating otherwise). In each direction, agent-specific temporal context is updated via LSTMs with shared parameters. All agents’ LSTM hidden states, ${\vec{h}}_{t - 1}$ , are subsequently used as node features in variational graph networks to ensure information-sharing across agents. This enables learning of a distribution over agent state deviations, $Δ {\vec{x}}_{t}$ . The process is likewise repeated in the backward-direction (right portion of the figure), with the directional updates fused to produce an imputed estimate ${\hat{x}}_{t}$ at each time t. The dotted line indicates that the Graphnet encoder is used only at training time, with the GraphNet prior being used for the final evaluations conducted at test time.

**Figure 3**
Trajectory visualizations (best viewed when zoomed in). Each column provides an example trajectory sequence, with the first row illustrating the ground truth, and subsequent rows showing results from various models, including the Graph Imputer (ours). For all examples, the Graph Imputer trajectories seamlessly adhere to the boundary value constraints imposed at the moments of disappearance and reappearance of players.

**Figure 4**
Pitch control error visualizations. The first column shows the ground truth pitch control field, player positions, and the camera field of view. Each remaining column provides a visualization of the absolute error between pitch control fields based on predicted model outputs and ground truth.

See this image and copyright information in PMC

References

1. Foerster, J. N., Farquhar, G., Afouras, T., Nardelli, N., & Whiteson, S. Counterfactual multi-agent policy gradients. In Proc. Thirty-Second AAAI Conference on Artificial Intelligence, (AAAI-18), the 30th Innovative Applications of Artificial Intelligence (IAAI-18), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI-18), New Orleans, Louisiana, USA, February 2–7, 2018 (eds. McIlraith, S. A. & Weinberger, K. Q.) 2974–2982 (AAAI Press, 2018).
1. Brown, N., Lerer, A., Gross, S. & Sandholm, T. Deep counterfactual regret minimization. In Proc. 36th International Conference on Machine Learning, ICML 2019, 9–15 June 2019, Long Beach, California, USA, Volume 97 of Proc. Machine Learning Research (eds. Chaudhuri, K. & Salakhutdinov, R.) 793–802 (PMLR, 2019).
1. Sun, C., Karlsson, P., Wu, J., Tenenbaum, J. B. & Murphy, K. Predicting the present and future states of multi-agent systems from partially-observed visual data. In International Conference on Learning Representations. https://openreview.net/forum?id=r1xdH3CcKX (2019).
1. Taylor SJ. Modelling Financial Time Series. Number 6578 in World Scientific Books. World Scientific Publishing; 2007.
1. Sezer, O. B., Gudelek, M. U. & Özbayoglu, A. M. Financial time series forecasting with deep learning: A systematic literature review: 2005–2019. CoRR. http://arxiv.org/abs/1911.13288 (2019).

MeSH terms

Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Multiagent off-screen behavior prediction in football

Affiliations

Multiagent off-screen behavior prediction in football

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

MeSH terms

LinkOut - more resources

Full Text Sources

Miscellaneous