Robot Learning From Randomized Simulations: A Review
- PMID: 35494543
- PMCID: PMC9038844
- DOI: 10.3389/frobt.2022.799893
Robot Learning From Randomized Simulations: A Review
Abstract
The rise of deep learning has caused a paradigm shift in robotics research, favoring methods that require large amounts of data. Unfortunately, it is prohibitively expensive to generate such data sets on a physical platform. Therefore, state-of-the-art approaches learn in simulation where data generation is fast as well as inexpensive and subsequently transfer the knowledge to the real robot (sim-to-real). Despite becoming increasingly realistic, all simulators are by construction based on models, hence inevitably imperfect. This raises the question of how simulators can be modified to facilitate learning robot control policies and overcome the mismatch between simulation and reality, often called the "reality gap." We provide a comprehensive review of sim-to-real research for robotics, focusing on a technique named "domain randomization" which is a method for learning from randomized simulations.
Keywords: domain randomization; reality gap; reinforcement learning; robotics; sim-to-real; simulation; simulation optimization bias.
Copyright © 2022 Muratore, Ramos, Turk, Yu, Gienger and Peters.
Conflict of interest statement
Author FM was employed by the Technical University of Darmstadt in collaboration with the Honda Research Institute Europe. Author FR was employed by NVIDIA. Author WY was employed by Google. Author MG was employed by the Honda Research Institute Europe. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The authors declare that this study received funding from the Honda Research Institute Europe. The funder had the following involvement in the study: the structuring and improvement of this article jointly with the authors, and the decision to submit it for publication.
Figures






References
-
- Abdulsamad H., Dorau T., Belousov B., Zhu J., Peters J. (2021). Distributionally Robust Trajectory Optimization under Uncertain Dynamics via Relative-Entropy Trust Regions. arXiv 2103.15388
-
- Alghonaim R., Johns E. (2020). Benchmarking Domain Randomisation for Visual Sim-To-Real Transfer. arXiv 2011.07112
-
- Allevato A., Short E. S., Pryor M., Thomaz A. (2019). Tunenet: One-Shot Residual Tuning for System Identification and Sim-To-Real Robot Task Transfer. In Conference on Robot Learning (CoRL), Osaka, Japan, October 30 - November 1 (PMLR; ), vol. 100 of Proc. Machine Learn. Res., 445–455.
-
- Andrychowicz M., Crow D., Ray A., Schneider J., Fong R., Welinder P., et al. (2017). “Hindsight Experience Replay,” in Conference on Neural Information Processing Systems (NIPS), December 4-9 (Long Beach, CA, USA, 5048–5058.
Publication types
LinkOut - more resources
Full Text Sources
Miscellaneous