Actor-critic reinforcement learning for the feedback control of a swinging chain

C. Dengler, B. Lohmann

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Reinforcement learning offers a multitude of algorithms allowing to learn a nonlinear controller by interacting with the system without the need for a model of the plant. In this paper we investigate the suitability of online learning algorithms for a control task with incomplete state information. The system under consideration is a swinging chain that needs to be stabilized at a desired position, a problem that is occurring e.g. with bridge cranes with each change in the crane position. The measurable states are the position, velocity, angle and angular velocity at the top of the chain. A solution of the control problem based on an approximation of the chain as a continuous cable exists in the literature, see d’ Andrea-Novel and Coron (2000), which is included in the comparison as a reference for the control performance of the learned controllers.

Original languageEnglish
Pages (from-to)378-383
Number of pages6
Journal2nd IFAC Conference on Modelling, Identification and Control of Nonlinear Systems MICNON 2018: Guadalajara, Jalisco, Mexico, 20-22 June 2018
Volume51
Issue number13
DOIs
StatePublished - 1 Jan 2018

Keywords

  • Dynamic modeling
  • Function approximation
  • Learning algorithms
  • Nonlinear control

Fingerprint

Dive into the research topics of 'Actor-critic reinforcement learning for the feedback control of a swinging chain'. Together they form a unique fingerprint.

Cite this