TY - JOUR
T1 - Actor-critic reinforcement learning for the feedback control of a swinging chain⁎
AU - Dengler, C.
AU - Lohmann, B.
N1 - Publisher Copyright:
© 2018
PY - 2018/1/1
Y1 - 2018/1/1
N2 - Reinforcement learning offers a multitude of algorithms allowing to learn a nonlinear controller by interacting with the system without the need for a model of the plant. In this paper we investigate the suitability of online learning algorithms for a control task with incomplete state information. The system under consideration is a swinging chain that needs to be stabilized at a desired position, a problem that is occurring e.g. with bridge cranes with each change in the crane position. The measurable states are the position, velocity, angle and angular velocity at the top of the chain. A solution of the control problem based on an approximation of the chain as a continuous cable exists in the literature, see d’ Andrea-Novel and Coron (2000), which is included in the comparison as a reference for the control performance of the learned controllers.
AB - Reinforcement learning offers a multitude of algorithms allowing to learn a nonlinear controller by interacting with the system without the need for a model of the plant. In this paper we investigate the suitability of online learning algorithms for a control task with incomplete state information. The system under consideration is a swinging chain that needs to be stabilized at a desired position, a problem that is occurring e.g. with bridge cranes with each change in the crane position. The measurable states are the position, velocity, angle and angular velocity at the top of the chain. A solution of the control problem based on an approximation of the chain as a continuous cable exists in the literature, see d’ Andrea-Novel and Coron (2000), which is included in the comparison as a reference for the control performance of the learned controllers.
KW - Dynamic modeling
KW - Function approximation
KW - Learning algorithms
KW - Nonlinear control
UR - http://www.scopus.com/inward/record.url?scp=85052655483&partnerID=8YFLogxK
U2 - 10.1016/j.ifacol.2018.07.308
DO - 10.1016/j.ifacol.2018.07.308
M3 - Article
AN - SCOPUS:85052655483
SN - 2405-8963
VL - 51
SP - 378
EP - 383
JO - 2nd IFAC Conference on Modelling, Identification and Control of Nonlinear Systems MICNON 2018: Guadalajara, Jalisco, Mexico, 20-22 June 2018
JF - 2nd IFAC Conference on Modelling, Identification and Control of Nonlinear Systems MICNON 2018: Guadalajara, Jalisco, Mexico, 20-22 June 2018
IS - 13
ER -