Ensemble Probabilistic Model Predictive Safety Certification for Learning-Based Control

Nehir Guzelkaya, Sven Gronauer, Klaus Diepold

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Despite the proven efficacy of learning-based control schemes, the absence of safety guarantees prevents their deployment in many applications. This paper addresses this problem by introducing a novel safety certification framework that leverages the principles of stochastic model predictive control. Our proposed framework modifies the actions taken by a learning-based controller in a minimally invasive fashion to enable non-conservative decision-making with adjustable safety levels for systems subject to additive unbounded noise. In addition to that, an ensemble of probabilistic neural networks is used to model the unknown aspects of the system dynamics. In situations where the proposed safety certification fails to find solutions, a constraint satisfaction probability maximizing backup algorithm is proposed. Finally, the efficacy of the proposed safety certification framework in reducing the number of violations is shown through simulations of reinforcement learning within a range of diverse environments.

Original languageEnglish
Title of host publication2024 IEEE 19th Conference on Industrial Electronics and Applications, ICIEA 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350360868
DOIs
StatePublished - 2024
Event19th IEEE Conference on Industrial Electronics and Applications, ICIEA 2024 - Kristiansand, Norway
Duration: 5 Aug 20248 Aug 2024

Publication series

Name2024 IEEE 19th Conference on Industrial Electronics and Applications, ICIEA 2024

Conference

Conference19th IEEE Conference on Industrial Electronics and Applications, ICIEA 2024
Country/TerritoryNorway
CityKristiansand
Period5/08/248/08/24

Fingerprint

Dive into the research topics of 'Ensemble Probabilistic Model Predictive Safety Certification for Learning-Based Control'. Together they form a unique fingerprint.

Cite this