End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation
- PMID: 34956359
- PMCID: PMC8702337
- DOI: 10.1155/2021/9945044
End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation
Abstract
Developing artificial intelligence (AI) agents is challenging for efficient exploration in visually rich and complex environments. In this study, we formulate the exploration question as a reinforcement learning problem and rely on intrinsic motivation to guide exploration behavior. Such intrinsic motivation is driven by curiosity and is calculated based on episode memory. To distribute the intrinsic motivation, we use a count-based method and temporal distance to generate it synchronously. We tested our approach in 3D maze-like environments and validated its performance in exploration tasks through extensive experiments. The experimental results show that our agent can learn exploration ability from raw sensory input and accomplish autonomous exploration across different mazes. In addition, the learned policy is not biased by stochastic objects. We also analyze the effects of different training methods and driving forces on exploration policy.
Copyright © 2021 Xiaogang Ruan et al.
Conflict of interest statement
The authors declare that they have no conflicts of interest to report regarding the present study.
Figures














References
-
- Oudeyer P. Y. Computational theories of curiosity-driven learning. 2018. https://arxiv.org/abs/1802.10546 .
-
- Gupta S., Tolani V., Davidson J., Levine S., Sukthankar R., Malik J. Cognitive mapping and planning for visual navigation. 2019. https://arxiv.org/abs/1702.3920 .
-
- Cadena C., Carlone L., Carrillo H., et al. Past, present, and future of simultaneous localization and mapping: toward the robust-perception age. IEEE Transactions on Robotics . 2016;32(6):1309–1332. doi: 10.1109/tro.2016.2624754. - DOI
-
- Abed-Alguni B. H. Action-selection method for reinforcement learning based on cuckoo search algorithm. Arabian Journal for Science and Engineering . 2018;43(12):6771–6785. doi: 10.1007/s13369-017-2873-8. - DOI
MeSH terms
LinkOut - more resources
Full Text Sources