Learning a Low-Dimensional Representation of a Safe Region for Safe Reinforcement Learning on Dynamical Systems

Zhehua Zhou, Ozgur S. Oguz, Marion Leibold, Martin Buss

Publikation: Beitrag in FachzeitschriftArtikelBegutachtung

3 Zitate (Scopus)

Abstract

For the safe application of reinforcement learning algorithms to high-dimensional nonlinear dynamical systems, a simplified system model is used to formulate a safe reinforcement learning (SRL) framework. Based on the simplified system model, a low-dimensional representation of the safe region is identified and used to provide safety estimates for learning algorithms. However, finding a satisfying simplified system model for complex dynamical systems usually requires a considerable amount of effort. To overcome this limitation, we propose a general data-driven approach that is able to efficiently learn a low-dimensional representation of the safe region. By employing an online adaptation method, the low-dimensional representation is updated using the feedback data to obtain more accurate safety estimates. The performance of the proposed approach for identifying the low-dimensional representation of the safe region is illustrated using the example of a quadcopter. The results demonstrate a more reliable and representative low-dimensional representation of the safe region compared with previous works, which extends the applicability of the SRL framework.

OriginalspracheEnglisch
Seiten (von - bis)2513-2527
Seitenumfang15
FachzeitschriftIEEE Transactions on Neural Networks and Learning Systems
Jahrgang34
Ausgabenummer5
DOIs
PublikationsstatusVeröffentlicht - 1 Mai 2023

Fingerprint

Untersuchen Sie die Forschungsthemen von „Learning a Low-Dimensional Representation of a Safe Region for Safe Reinforcement Learning on Dynamical Systems“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren