Q-RAN: A constructive reinforcement learning approach for robot behavior learning

Jun Li, Achim Lilienthal, Tomás Martínez-Marín, Tom Duckett

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Scopus citations

Abstract

This paper presents a learning system that uses Q-learning with a resource allocating network (RAN) for behavior learning in mobile robotics. The RAN is used as a function approximator, and Q-learning is used to learn the control policy in 'off-policy' fashion that enables learning to be bootstrapped by a prior knowledge controller, thus speeding up the reinforcement learning. Our approach is verified on a PeopleBot robot executing a visual servoing based docking behavior in which the robot is required to reach a goal pose. Further experiments show that the RAN network can also be used for supervised learning prior to reinforcement learning in a layered architecture, thus further improving the performance of the docking behavior.

Original languageEnglish
Title of host publication2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2006
Pages2656-2662
Number of pages7
DOIs
StatePublished - 2006
Externally publishedYes
Event2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2006 - Beijing, China
Duration: 9 Oct 200615 Oct 2006

Publication series

NameIEEE International Conference on Intelligent Robots and Systems

Conference

Conference2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2006
Country/TerritoryChina
CityBeijing
Period9/10/0615/10/06

Fingerprint

Dive into the research topics of 'Q-RAN: A constructive reinforcement learning approach for robot behavior learning'. Together they form a unique fingerprint.

Cite this