TY - GEN
T1 - Q-RAN
T2 - 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2006
AU - Li, Jun
AU - Lilienthal, Achim
AU - Martínez-Marín, Tomás
AU - Duckett, Tom
PY - 2006
Y1 - 2006
N2 - This paper presents a learning system that uses Q-learning with a resource allocating network (RAN) for behavior learning in mobile robotics. The RAN is used as a function approximator, and Q-learning is used to learn the control policy in 'off-policy' fashion that enables learning to be bootstrapped by a prior knowledge controller, thus speeding up the reinforcement learning. Our approach is verified on a PeopleBot robot executing a visual servoing based docking behavior in which the robot is required to reach a goal pose. Further experiments show that the RAN network can also be used for supervised learning prior to reinforcement learning in a layered architecture, thus further improving the performance of the docking behavior.
AB - This paper presents a learning system that uses Q-learning with a resource allocating network (RAN) for behavior learning in mobile robotics. The RAN is used as a function approximator, and Q-learning is used to learn the control policy in 'off-policy' fashion that enables learning to be bootstrapped by a prior knowledge controller, thus speeding up the reinforcement learning. Our approach is verified on a PeopleBot robot executing a visual servoing based docking behavior in which the robot is required to reach a goal pose. Further experiments show that the RAN network can also be used for supervised learning prior to reinforcement learning in a layered architecture, thus further improving the performance of the docking behavior.
UR - http://www.scopus.com/inward/record.url?scp=34250630005&partnerID=8YFLogxK
U2 - 10.1109/IROS.2006.281986
DO - 10.1109/IROS.2006.281986
M3 - Conference contribution
AN - SCOPUS:34250630005
SN - 142440259X
SN - 9781424402595
T3 - IEEE International Conference on Intelligent Robots and Systems
SP - 2656
EP - 2662
BT - 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2006
Y2 - 9 October 2006 through 15 October 2006
ER -