Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression

Jun Morimoto, Christopher G. Atkeson, Gen Endo, Gordon Cheng

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

22 Scopus citations

Abstract

We propose to improve the locomotive performance of humanoid robots by using approximated biped stepping and walking dynamics with reinforcement learning (RL). Although RL is a useful non-linear optimizer, it is usually difficult to apply RL to real robotic systems - due to the large number of iterations required to acquire suitable policies. In this study, we first approximated the dynamics by using data from a real robot, and then applied the estimated dynamics in RL in order to improve stepping and walking policies. Gaussian processes were used to approximate the dynamics. By using Gaussian processes, we could estimate a probability distribution of a target function with a given covariance function. Thus, RL can take the uncertainty of the approximated dynamics into account throughout the learning process. We show that we can improve stepping and walking policies by using a RL method with the approximated models both in simulated and real environments. Experimental validation on a real humanoid robot of the proposed learning approach is presented.

Original languageEnglish
Title of host publicationProceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2007
Pages4234-4240
Number of pages7
DOIs
StatePublished - 2007
Externally publishedYes
Event2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2007 - San Diego, CA, United States
Duration: 29 Oct 20072 Nov 2007

Publication series

NameIEEE International Conference on Intelligent Robots and Systems

Conference

Conference2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2007
Country/TerritoryUnited States
CitySan Diego, CA
Period29/10/072/11/07

Fingerprint

Dive into the research topics of 'Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression'. Together they form a unique fingerprint.

Cite this