Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 14:16:826410.
doi: 10.3389/fnbot.2022.826410. eCollection 2022.

A Bio-Inspired Mechanism for Learning Robot Motion From Mirrored Human Demonstrations

Affiliations

A Bio-Inspired Mechanism for Learning Robot Motion From Mirrored Human Demonstrations

Omar Zahra et al. Front Neurorobot. .

Abstract

Different learning modes and mechanisms allow faster and better acquisition of skills as widely studied in humans and many animals. Specific neurons, called mirror neurons, are activated in the same way whether an action is performed or simply observed. This suggests that observing others performing movements allows to reinforce our motor abilities. This implies the presence of a biological mechanism that allows creating models of others' movements and linking them to the self-model for achieving mirroring. Inspired by such ability, we propose to build a map of movements executed by a teaching agent and mirror the agent's state to the robot's configuration space. Hence, in this study, a neural network is proposed to integrate a motor cortex-like differential map transforming motor plans from task-space to joint-space motor commands and a static map correlating joint-spaces of the robot and a teaching agent. The differential map is developed based on spiking neural networks while the static map is built as a self-organizing map. The developed neural network allows the robot to mirror the actions performed by a human teaching agent to its own joint-space and the reaching skill is refined by the complementary examples provided. Hence, experiments are conducted to quantify the improvement achieved thanks to the proposed learning approach and control scheme.

Keywords: imitation learning; robotics; sensor-based control; spiking neural networks; visual servoing.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
The robotic manipulator and human arm sharing the same end effector position and jointly moving during motor babbling to provide a proper training data.
Figure 2
Figure 2
Schematic diagrams for motor cortex-like map (MCM).
Figure 3
Figure 3
(A) The symmetric spike-timing-dependent plasticity (STDP) learning rule at various values for τa and τb. (B) A plot of change in synaptic weight vs. training iterations.
Figure 4
Figure 4
(A) A three link planar arm employed for numerical simulation. The robot deviates from a target path ϕ while moving from the current pose to the target pose to move through ρ instead. (B) The data collection schemes are illustrated. Motor babbling (in the upper panel) commands linear motion in joint space under the guidance of a differential forward kinematic DFK solver. Target path follower (in the lower panel) under the guidance of a differential inverse kinematic DIK solver.
Figure 5
Figure 5
Schematic diagrams for SOMs connected through Oja-Hebbian plastic synapses. This architecture allows correlating the joint spaces of the human arm and robot arm. During the training phase, BMUs (in AJ-SOM and RJ-SOM) from both maps that fire together are more likely to have an increase in strength of the connecting synapses. Consequently, during the control phase, if the same BMU in AJ-SOM becomes active, the corresponding node in RJ-SOM becomes active as well.
Figure 6
Figure 6
A schematic diagram of the correspondence of the human and robot joint spaces along with the task space TS. The data collected from the human and robot together allows building correlation (i.e., f(qh) between JSH and JSR). This allows generating more examples to train the map g(qr, v) correlating TS to JSR by transforming examples conducted by the human arm (in TS) into JSR.
Figure 7
Figure 7
A schematic diagram of the full system including both the static (SOMs network) and differential(MCM network) maps formed. Blocks with an inclined arrow passing through are the ones where learning/adaptation occurs during each phase.
Figure 8
Figure 8
(A) The human arm while moving in straight paths and (B) the plot of the Cartesian position of the hand-held object while moving.
Figure 9
Figure 9
Heatmaps of the standard Kohonen SOM depicting the relation between each input from the joint spaces of (A) the human arm and (B) robot agents.
Figure 10
Figure 10
Heatmaps of the varying density SOM depicting the relation between each input from the joint spaces of (A) the human arm and (B) robot agents.
Figure 11
Figure 11
(A) The robot arm while moving after training by the transformed data and (B) the plot of the Cartesian position of the end effector while moving.
Figure 12
Figure 12
Values of the fitness function vs. the number of optimization iterations.

Similar articles

Cited by

References

    1. Amari S., et al. . (2003). The Handbook of Brain Theory and Neural Networks. MIT Press.
    1. Argall B. D., Chernova S., Veloso M., Browning B. (2009). A survey of robot learning from demonstration. Robot. Auton. Syst. 57, 469–483. 10.1016/j.robot.2008.10.024 - DOI
    1. Arsenault B. (2018). Adaptive tree parzen estimator. Available online at: https://github.com/electricbrainio
    1. Bergstra J., Bardenet R., Bengio Y., Kégl B. (2011). Algorithms for hyper-parameter optimization, in 25th Annual Conference on Neural Information Processing Systems (NIPS 2011), vol. 24 (Red Hook, NY: Neural Information Processing Systems Foundation; ).
    1. Cook R., Bird G., Catmur C., Press C., Heyes C. (2014). Mirror neurons: from origin to function. Behav. Brain Sci. 37, 177–192. 10.1017/S0140525X13000903 - DOI - PubMed

LinkOut - more resources