Learning implicit models during target pursuit

Chris Gaskett, Peter Brown, Gordon Cheng, Alexander Zelinsky

Research output: Contribution to journalConference articlepeer-review

2 Scopus citations

Abstract

Smooth control using an active vision head's verge-axis joint is performed through continuous state and action reinforcement learning. The system learns to perform visual servoing based on rewards given relative to tracking performance. The learned controller compensates for the velocity of the target and performs lag-free pursuit of a swinging target. By comparing controllers exposed to different environments we show that the controller is predicting the motion of the target by forming an implicit model of the target's motion. Experimental results are presented that demonstrate the advantages and disadvantages of implicit modelling.

Original languageEnglish
Pages (from-to)4122-4129
Number of pages8
JournalProceedings - IEEE International Conference on Robotics and Automation
Volume3
StatePublished - 2003
Externally publishedYes
Event2003 IEEE International Conference on Robotics and Automation - Taipei, Taiwan, Province of China
Duration: 14 Sep 200319 Sep 2003

Fingerprint

Dive into the research topics of 'Learning implicit models during target pursuit'. Together they form a unique fingerprint.

Cite this