Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Aug 3;18(8):2539.
doi: 10.3390/s18082539.

Efficient Force Control Learning System for Industrial Robots Based on Variable Impedance Control

Affiliations

Efficient Force Control Learning System for Industrial Robots Based on Variable Impedance Control

Chao Li et al. Sensors (Basel). .

Abstract

Learning variable impedance control is a powerful method to improve the performance of force control. However, current methods typically require too many interactions to achieve good performance. Data-inefficiency has limited these methods to learn force-sensitive tasks in real systems. In order to improve the sampling efficiency and decrease the required interactions during the learning process, this paper develops a data-efficient learning variable impedance control method that enables the industrial robots automatically learn to control the contact force in the unstructured environment. To this end, a Gaussian process model is learned as a faithful proxy of the system, which is then used to predict long-term state evolution for internal simulation, allowing for efficient strategy updates. The effects of model bias are reduced effectively by incorporating model uncertainty into long-term planning. Then the impedance profiles are regulated online according to the learned humanlike impedance strategy. In this way, the flexibility and adaptivity of the system could be enhanced. Both simulated and experimental tests have been performed on an industrial manipulator to verify the performance of the proposed method.

Keywords: Gaussian processes; efficient learning; force control; industrial robot; variable impedance control.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
The interaction model of the system. (a) Without any contact between the robot and the environment; (b) critical point when contact occurs; (c) stable contact with the environment; (d) contact force diagram when robot comes in contact with the environment.
Figure 2
Figure 2
Contact force and moment applied at the end-effector.
Figure 3
Figure 3
An implementation example of the contact force observer.
Figure 4
Figure 4
The position-based impedance control schematic.
Figure 5
Figure 5
Scheme of the data-efficient learning variable impedance control.
Figure 6
Figure 6
A conceptual illustration of long-term predictions of state evolution.
Figure 7
Figure 7
The block diagram of simulation in MATLAB Simulink.
Figure 8
Figure 8
Simulation results of force control learning system. (a) The cost curve of learning process; the blue dotted line and the blue shade are the predicted cost mean and the 95% confidence interval. (b) The performances of force control during learning process.
Figure 9
Figure 9
Learning process of the total 20 learning iterations. The first row: contact force in z-axis direction; the second row: Cartesian stiffness schedules; the third row: Cartesian damping schedules; the fourth row: position of the end-effector in z-axis direction; the fifth row: velocity of the end-effector in z-axis direction.
Figure 10
Figure 10
Joint trajectories after 4, 5, 10, 15, and 20 updates for the second, third, and fifth joint of the Reinovo robot.
Figure 11
Figure 11
States evolution of force control. Columns (ad) are the state evolutions of the 1st, 6th, 12th, and 20th learning iteration, respectively. The top row is the change of contact force Fz, the second row is the target stiffness Kdz, and the third row is the target damping Bdz.
Figure 12
Figure 12
Hardware architecture of the system.
Figure 13
Figure 13
Implementation diagram of the algorithm.
Figure 14
Figure 14
(a) Experimental setup; (b) simplified model of the contact environment.
Figure 15
Figure 15
Experimental results of force control learning system. (a) The cost curve of learning process. (b) The performances of force control during learning process, including a total 20 learning iterations throughout the experiment.
Figure 16
Figure 16
Main iterations of the learning process. (a) Contact force; (b) Cartesian stiffness schedules; (c) Cartesian damping schedules.
Figure 17
Figure 17
States evolution of force control. Columns (ad) are the state evolutions of the 1st, 6th, 12th, and 20th learning iteration, respectively. The top row shows the contact force Fz, the second row shows the profile of stiffness Kdz, and the third row shows the profile of damping Bdz.
Figure 18
Figure 18
Trajectories during the 20th experiment iteration. (a) Joint position; (b) Joint velocity; (c) Cartesian position of the end-effector; (d) Cartesian velocity of the end-effector.
Figure 19
Figure 19
Experimental results of environmental adaptability. (a) Cost curve; (b) contact force; (c) Cartesian stiffness schedules; (d) Cartesian damping schedules.
Figure 20
Figure 20
Experimental comparison results. (a) Force control performance; (b) target stiffness; (c) target damping.
Figure 21
Figure 21
(a) The cost curve of the learning process. (b) Comparison of learning speed with other learning variable impedance control methods.
Figure 22
Figure 22
Computational time for each rollout.

References

    1. Siciliano B., Sciavicco L., Villani L., Oriolo G. Robotics: Modelling, planning and control. Adv. Textb. Control Signal Process. 2009;4:76–82.
    1. Hogan N. Impedance control: An approach to manipulation: Part I—Theory. J. Dyn. Syst. Meas. Control. 1985;107:1–7. doi: 10.1115/1.3140702. - DOI
    1. Burdet E., Osu R., Franklin D.W., Milner T.E., Kawato M. The central nervous system stabilizes unstable dynamics by learning optimal impedance. Nature. 2001;414:446–449. doi: 10.1038/35106566. - DOI - PubMed
    1. Kieboom J.V.D., Ijspeert A.J. Exploiting natural dynamics in biped locomotion using variable impedance control; Proceedings of the 13th IEEE-RAS International Conference on Humanoid Robots; Atlanta, GA, USA. 15–17 October 2013; pp. 348–353. - DOI
    1. Buchli J., Stulp F., Theodorou E., Schaal S. Learning variable impedance control. Int. J. Robot. Res. 2011;30:820–833. doi: 10.1177/0278364911402527. - DOI

LinkOut - more resources