Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Feb 21:9:717193.
doi: 10.3389/frobt.2022.717193. eCollection 2022.

Affect-Driven Learning of Robot Behaviour for Collaborative Human-Robot Interactions

Affiliations

Affect-Driven Learning of Robot Behaviour for Collaborative Human-Robot Interactions

Nikhil Churamani et al. Front Robot AI. .

Abstract

Collaborative interactions require social robots to share the users' perspective on the interactions and adapt to the dynamics of their affective behaviour. Yet, current approaches for affective behaviour generation in robots focus on instantaneous perception to generate a one-to-one mapping between observed human expressions and static robot actions. In this paper, we propose a novel framework for affect-driven behaviour generation in social robots. The framework consists of (i) a hybrid neural model for evaluating facial expressions and speech of the users, forming intrinsic affective representations in the robot, (ii) an Affective Core, that employs self-organising neural models to embed behavioural traits like patience and emotional actuation that modulate the robot's affective appraisal, and (iii) a Reinforcement Learning model that uses the robot's appraisal to learn interaction behaviour. We investigate the effect of modelling different affective core dispositions on the affective appraisal and use this affective appraisal as the motivation to generate robot behaviours. For evaluation, we conduct a user study (n = 31) where the NICO robot acts as a proposer in the Ultimatum Game. The effect of the robot's affective core on its negotiation strategy is witnessed by participants, who rank a patient robot with high emotional actuation higher on persistence, while an impatient robot with low emotional actuation is rated higher on its generosity and altruistic behaviour.

Keywords: core affect; human-robot interaction; multi-modal affect perception; neural networks; reinforcement learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The handling Editor is currently co-organizing a Research Topic with one of the authors PB and declared a past collaboration with one of the authors HG.

Figures

FIGURE 1
FIGURE 1
Proposed Framework: Multi-Modal Affect Perception of the robot combines facial and auditory features using the MCCNN. The Perception-GWR creates feature prototypes with the BMUs encoding arousal-valence while repeated interactions form the Affective Memory. The Affective Core models affective dispositions in the robot, resulting from its social conditioning and time perception. Current perception, affective memory and the affective core influence Mood formation which is used by the Behaviour Generation model to learn negotiating behaviour in the Ultimatum Game.
FIGURE 2
FIGURE 2
Patient (A) and Impatient (B) Mood Modulation results in Affective Cores encoding corresponding Patient (C) and Impatient (D) Time Perception for the agent.
FIGURE 3
FIGURE 3
Affective Cores encoding Excitatory or High-arousal (A) and Inhibitory or Low-arousal (B) modulation to form the Social Conditioning of the agent.
FIGURE 4
FIGURE 4
Participant and the NICO robot negotiating in the Ultimatum Game scenario.
FIGURE 5
FIGURE 5
Actor-Critic model for Learning Robot behaviour.
FIGURE 6
FIGURE 6
Arousal and Valence distributions for different Affective Core biases. All distributions are compared to the No Core condition to measure significant differences due to the Affective Core of the robot. A * denotes a significant variation compared to the No Core condition.
FIGURE 7
FIGURE 7
Robot learning to negotiate, converging an offer >45% of the resources in 10 interactions.
FIGURE 8
FIGURE 8
Asch’s Test results with mean and 95% CI for individual dimensions comparing the two measured conditions. A * denotes a significant difference between the two conditions.

Similar articles

Cited by

References

    1. Ahn H., Picard R. W. (2005). “Affective-cognitive Learning and Decision Making: A Motivational Reward Framework for Affective Agents,” in Affective Computing and Intelligent Interaction. Editors Tao J., Tan T., Picard R. (Berlin: Springer Berlin Heidelberg; ), 866–873. 10.1007/11573548_111 - DOI
    1. Asch S. E. (1946). Forming Impressions of Personality. J. Abnormal Soc. Psychol. 41, 258–290. 10.1037/h0055756 - DOI - PubMed
    1. Bandyopadhyay D., Pammi V. S. C., Srinivasan N. (2013). Role of Affect in Decision Making. Prog. Brain Res. 202, 37–53. 10.1016/B978-0-444-62604-2.00003-4 - DOI - PubMed
    1. Barros P., Barakova E., Wermter S. (2020). Adapting the Interplay between Personalized and Generalized Affect Recognition Based on an Unsupervised Neural Framework. IEEE Trans. Affective Comput. 10.1109/taffc.2020.3002657 - DOI
    1. Barros P., Churamani N., Lakomkin E., Sequeira H., Sutherland A., Wermter S. (2018). “The OMG-Emotion Behavior Dataset,” in International Joint Conference on Neural Networks (IJCNN) (Rio de Janeiro, Brazil: IEEE; ), 1408–1414. 10.1109/ijcnn.2018.8489099 - DOI

LinkOut - more resources