Abstract
Smooth control using an active vision head's verge-axis joint is performed through continuous state and action reinforcement learning. The system learns to perform visual servoing based on rewards given relative to tracking performance. The learned controller compensates for the velocity of the target and performs lag-free pursuit of a swinging target. By comparing controllers exposed to different environments we show that the controller is predicting the motion of the target by forming an implicit model of the target's motion. Experimental results are presented that demonstrate the advantages and disadvantages of implicit modelling.
Original language | English |
---|---|
Pages (from-to) | 4122-4129 |
Number of pages | 8 |
Journal | Proceedings - IEEE International Conference on Robotics and Automation |
Volume | 3 |
State | Published - 2003 |
Externally published | Yes |
Event | 2003 IEEE International Conference on Robotics and Automation - Taipei, Taiwan, Province of China Duration: 14 Sep 2003 → 19 Sep 2003 |