Meta attention for Off-Policy Actor-Critic

Jiateng Huang¹, Wanrong Huang², Long Lan², Dan Wu³

Affiliations

¹ National University of Defense Technology, College of Computer Science and Technology, Institute for Quantum Information & State Key Laboratory of High Performance Computing, Changsha, 410073, Hunan, China. Electronic address: Jiateng_Huang@outlook.com.
² National University of Defense Technology, College of Computer Science and Technology, Institute for Quantum Information & State Key Laboratory of High Performance Computing, Changsha, 410073, Hunan, China.
³ National University of Defense Technology, College of Computer & Hefei Interdisciplinary Center, Changsha, 410073, Hunan, China.

PMID: 37030278
DOI: 10.1016/j.neunet.2023.03.024

Meta attention for Off-Policy Actor-Critic

Jiateng Huang et al. Neural Netw. 2023 Jun.

. 2023 Jun:163:86-96.

doi: 10.1016/j.neunet.2023.03.024. Epub 2023 Mar 28.

Authors

Jiateng Huang¹, Wanrong Huang², Long Lan², Dan Wu³

Affiliations

¹ National University of Defense Technology, College of Computer Science and Technology, Institute for Quantum Information & State Key Laboratory of High Performance Computing, Changsha, 410073, Hunan, China. Electronic address: Jiateng_Huang@outlook.com.
² National University of Defense Technology, College of Computer Science and Technology, Institute for Quantum Information & State Key Laboratory of High Performance Computing, Changsha, 410073, Hunan, China.
³ National University of Defense Technology, College of Computer & Hefei Interdisciplinary Center, Changsha, 410073, Hunan, China.

PMID: 37030278
DOI: 10.1016/j.neunet.2023.03.024

Abstract

Off-Policy Actor-Critic methods can effectively exploit past experiences and thus they have achieved great success in various reinforcement learning tasks. In many image-based and multi-agent tasks, attention mechanism has been employed in Actor-Critic methods to improve their sampling efficiency. In this paper, we propose a meta attention method for state-based reinforcement learning tasks, which combines attention mechanism and meta-learning based on the Off-Policy Actor-Critic framework. Unlike previous attention-based work, our meta attention method introduces attention in the Actor and the Critic of the typical Actor-Critic framework, rather than in multiple pixels of an image or multiple information sources in specific image-based control tasks or multi-agent systems. In contrast to existing meta-learning methods, the proposed meta-attention approach is able to function in both the gradient-based training phase and the agent's decision-making process. The experimental results demonstrate the superiority of our meta-attention method in various continuous control tasks, which are based on the Off-Policy Actor-Critic methods including DDPG and TD3.

Keywords: Actor-Critic methods; Attention mechanism; Meta learning; Reinforcement learning.

PubMed Disclaimer

MeSH terms

Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Elsevier Science
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Meta attention for Off-Policy Actor-Critic

Affiliations

Meta attention for Off-Policy Actor-Critic

Authors

Affiliations

Abstract

MeSH terms

LinkOut - more resources

Full Text Sources

Miscellaneous