Review

. 2022 Apr 11:9:799893.

doi: 10.3389/frobt.2022.799893. eCollection 2022.

Robot Learning From Randomized Simulations: A Review

Fabio Muratore^{1

2}, Fabio Ramos^{3

4}, Greg Turk⁵, Wenhao Yu⁶, Michael Gienger², Jan Peters¹

Affiliations

¹ Intelligent Autonomous Systems Group, Technical University of Darmstadt, Darmstadt, Germany.
² Honda Research Institute Europe, Offenbach am Main, Germany.
³ School of Computer Science, University of Sydney, Sydney, NSW, Australia.
⁴ NVIDIA, Seattle, WA, United States.
⁵ Georgia Institute of Technology, Atlanta, GA, United States.
⁶ Robotics at Google, Mountain View, CA, United States.

PMID: 35494543
PMCID: PMC9038844
DOI: 10.3389/frobt.2022.799893

Review

Robot Learning From Randomized Simulations: A Review

Fabio Muratore et al. Front Robot AI. 2022.

. 2022 Apr 11:9:799893.

doi: 10.3389/frobt.2022.799893. eCollection 2022.

Authors

Fabio Muratore^{1

2}, Fabio Ramos^{3

4}, Greg Turk⁵, Wenhao Yu⁶, Michael Gienger², Jan Peters¹

Affiliations

¹ Intelligent Autonomous Systems Group, Technical University of Darmstadt, Darmstadt, Germany.
² Honda Research Institute Europe, Offenbach am Main, Germany.
³ School of Computer Science, University of Sydney, Sydney, NSW, Australia.
⁴ NVIDIA, Seattle, WA, United States.
⁵ Georgia Institute of Technology, Atlanta, GA, United States.
⁶ Robotics at Google, Mountain View, CA, United States.

PMID: 35494543
PMCID: PMC9038844
DOI: 10.3389/frobt.2022.799893

Abstract

The rise of deep learning has caused a paradigm shift in robotics research, favoring methods that require large amounts of data. Unfortunately, it is prohibitively expensive to generate such data sets on a physical platform. Therefore, state-of-the-art approaches learn in simulation where data generation is fast as well as inexpensive and subsequently transfer the knowledge to the real robot (sim-to-real). Despite becoming increasingly realistic, all simulators are by construction based on models, hence inevitably imperfect. This raises the question of how simulators can be modified to facilitate learning robot control policies and overcome the mismatch between simulation and reality, often called the "reality gap." We provide a comprehensive review of sim-to-real research for robotics, focusing on a technique named "domain randomization" which is a method for learning from randomized simulations.

Keywords: domain randomization; reality gap; reinforcement learning; robotics; sim-to-real; simulation; simulation optimization bias.

PubMed Disclaimer

Conflict of interest statement

Author FM was employed by the Technical University of Darmstadt in collaboration with the Honda Research Institute Europe. Author FR was employed by NVIDIA. Author WY was employed by Google. Author MG was employed by the Honda Research Institute Europe. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The authors declare that this study received funding from the Honda Research Institute Europe. The funder had the following involvement in the study: the structuring and improvement of this article jointly with the authors, and the decision to submit it for publication.

Figures

**FIGURE 1**
Examples of sim-to-real robot learning research using domain randomization: (left) Multiple simulation instances of robotic in-hand manipulation (OpenAI et al., 2020), (middle top) transformation to a canonical simulation (James et al., 2019), (middle bottom) synthetic 3D hallways generated for indoor drone flight (Sadeghi and Levine, 2017), (right top) ball-in-a-cup task solved with adaptive dynamics randomization (Muratore et al., 2021a), (right bottom) quadruped locomotion (Tan et al., 2018).

**FIGURE 2**
Topological overview of the sim-to-real research and a selection of related fields.

**FIGURE 3**
Topological overview of domain randomization methods.

**FIGURE 4**
Conceptual illustration of static domain randomization.

**FIGURE 5**
Conceptual illustration of adaptive domain randomization.

**FIGURE 6**
Conceptual illustration of adversarial domain randomization.

See this image and copyright information in PMC

References

1. Abdulsamad H., Dorau T., Belousov B., Zhu J., Peters J. (2021). Distributionally Robust Trajectory Optimization under Uncertain Dynamics via Relative-Entropy Trust Regions. arXiv 2103.15388
1. Alghonaim R., Johns E. (2020). Benchmarking Domain Randomisation for Visual Sim-To-Real Transfer. arXiv 2011.07112
1. Allevato A., Short E. S., Pryor M., Thomaz A. (2019). Tunenet: One-Shot Residual Tuning for System Identification and Sim-To-Real Robot Task Transfer. In Conference on Robot Learning (CoRL), Osaka, Japan, October 30 - November 1 (PMLR; ), vol. 100 of Proc. Machine Learn. Res., 445–455.
1. Amari S.-i. (1977). Dynamics of Pattern Formation in Lateral-Inhibition Type Neural fields. Biol. Cybern. 27, 77–87. 10.1007/bf00337259 - DOI - PubMed
1. Andrychowicz M., Crow D., Ray A., Schneider J., Fong R., Welinder P., et al. (2017). “Hindsight Experience Replay,” in Conference on Neural Information Processing Systems (NIPS), December 4-9 (Long Beach, CA, USA, 5048–5058.

Publication types

Actions

LinkOut - more resources

Full Text Sources
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Robot Learning From Randomized Simulations: A Review

Affiliations

Robot Learning From Randomized Simulations: A Review

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

LinkOut - more resources

Full Text Sources

Miscellaneous