Continuous Control of Autonomous Vehicles using Plan-assisted Deep Reinforcement Learning

Tanay Dwivedi, Tobias Betz, Florian Sauerbeck, Pv Manivannan, Markus Lienkamp

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

1 Zitat (Scopus)

Abstract

End-to-end deep reinforcement learning (DRL) is emerging as a promising paradigm for autonomous driving. Although DRL provides an elegant framework to accomplish final goals without extensive manual engineering, capturing plans and behavior using deep neural networks is still an unsolved issue. End-to-end architectures, as a result, are currently limited to simple driving scenarios, often performing sub-optimally when rare, unique conditions are encountered. We propose a novel plan-assisted deep reinforcement learning framework that, along with the typical state-space, leverages a 'trajectory-space' to learn optimal control. While the trajectory-space, generated by an external planner, intrinsically captures the agent's high-level plans, world models are used to understand the dynamics of the environment for learning behavior in latent space. An actor-critic network, trained in imagination, uses these latent features to predict policy and state-value function. Based primarily on DreamerV2 and Racing Dreamer, the proposed model is first trained in a simulator and eventually tested on the FITENTH race car. We evaluate our model for best lap times against parameter-tuned and learning-based controllers on unseen race tracks and demonstrate that it generalizes to complex scenarios where other approaches perform sub-optimally. Furthermore, we show the model's enhanced stability as a trajectory tracker and establish the improvement in interpretability achieved by the proposed framework.

OriginalspracheEnglisch
Titel2022 22nd International Conference on Control, Automation and Systems, ICCAS 2022
Herausgeber (Verlag)IEEE Computer Society
Seiten244-250
Seitenumfang7
ISBN (elektronisch)9788993215243
DOIs
PublikationsstatusVeröffentlicht - 2022
Veranstaltung22nd International Conference on Control, Automation and Systems, ICCAS 2022 - Busan, Südkorea
Dauer: 27 Nov. 20221 Dez. 2022

Publikationsreihe

NameInternational Conference on Control, Automation and Systems
Band2022-November
ISSN (Print)1598-7833

Konferenz

Konferenz22nd International Conference on Control, Automation and Systems, ICCAS 2022
Land/GebietSüdkorea
OrtBusan
Zeitraum27/11/221/12/22

Fingerprint

Untersuchen Sie die Forschungsthemen von „Continuous Control of Autonomous Vehicles using Plan-assisted Deep Reinforcement Learning“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren