A Safe Reinforcement Learning driven Weights-varying Model Predictive Control for Autonomous Vehicle Motion Control

Baha Zarrouki, Marios Spanakakis, Johannes Betz

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

Abstract

Determining the optimal cost function parameters of Model Predictive Control (MPC) to optimize multiple control objectives is a challenging and time-consuming task. Multi-objective Bayesian Optimization (BO) techniques solve this problem by determining a Pareto optimal parameter set for an MPC with static weights. However, a single parameter set may not deliver the most optimal closed-loop control performance when the context of the MPC operating conditions changes during its operation, urging the need to adapt the cost function weights at runtime. Deep Reinforcement Learning (RL) algorithms can automatically learn context-dependent optimal parameter sets and dynamically adapt for a Weights-varying MPC (WMPC). However, learning cost function weights from scratch in a continuous action space may lead to unsafe operating states. To solve this, we propose a novel approach limiting the RL action space within a safe learning space that we represent by a catalog of pre-optimized feasible BO Pareto-optimal weight sets. We conceive an RL agent not to learn in a continuous space but to select the most optimal discrete actions, each corresponding to a single set of Pareto optimal weights, by proactively anticipating upcoming control tasks in a context-dependent manner. This approach introduces a two-step optimization: (1) safety-critical with BO and (2) performance-driven with RL. Hence, even an untrained RL agent guarantees a safe and optimal performance. Simulation results demonstrate that an untrained RL-WMPC shows Pareto-optimal closed-loop behavior and training the RL-WMPC helps exhibit a performance beyond the Pareto-front. The code used in this research is publicly accessible as open-source software: https://github.com/bzarr/TUM-CONTROL

OriginalspracheEnglisch
Titel35th IEEE Intelligent Vehicles Symposium, IV 2024
Herausgeber (Verlag)Institute of Electrical and Electronics Engineers Inc.
Seiten1401-1408
Seitenumfang8
ISBN (elektronisch)9798350348811
DOIs
PublikationsstatusVeröffentlicht - 2024
Veranstaltung35th IEEE Intelligent Vehicles Symposium, IV 2024 - Jeju Island, Südkorea
Dauer: 2 Juni 20245 Juni 2024

Publikationsreihe

NameIEEE Intelligent Vehicles Symposium, Proceedings
ISSN (Print)1931-0587
ISSN (elektronisch)2642-7214

Konferenz

Konferenz35th IEEE Intelligent Vehicles Symposium, IV 2024
Land/GebietSüdkorea
OrtJeju Island
Zeitraum2/06/245/06/24

Fingerprint

Untersuchen Sie die Forschungsthemen von „A Safe Reinforcement Learning driven Weights-varying Model Predictive Control for Autonomous Vehicle Motion Control“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren