We are pleased to announce that our paper entitled “Learning on the Job: Long-Term Behavioural Adaptation in Human-Robot Interactions” by Francesco Del Duchetto and prof. Marc Hanheide has been accepted for publication in the IEEE Robotics and Automation Letters (RA-L) journal. This work originates from the research produced with the long term deployment of our robot Lindsey at the Collection Museum as part of the “Lindsey – A Robot Tour Guide” project.
With this work, we design a learning framework to allow autonomous robots interacting socially with people to adapt their own behaviour online. In particular, we propose this approach with the goal of improving the ability of Lindsey to deliver engaging guided tours to the museum visitors.
The robot is programmed with an initial “static” policy which is then updated over time as the robot explores different states and actions choices. To allow the learning to progress continually, effectively balancing the exploration of new states and actions with the exploitation of the successful ones already seen, we use UCBVI, a Reinforcement Learning (RL) algorithm based on the principle of “optimism in the face of uncertainty”. The learning signal for optimising the policy is the users’ engagement level detected in real-time during the interaction, from the robot head camera, by our learned regression model presented in our previous work “Are You Still With Me?”.
Results show that after a couple of months of exploration, the robot policy learned to maintain the engagement of users for longer, with an increase of 22.8% over the initial static policy in the number of items visited during the tour and a 30% increase in the probability of completing the tour.