TY - JOUR
T1 - Reinforcement Learning with Ensemble Model Predictive Safety Certification
AU - Gronauer, Sven
AU - Haider, Tom
AU - da Roza, Felippe Schmoeller
AU - Diepold, Klaus
N1 - Publisher Copyright:
© 2024 International Foundation for Autonomous Agents and Multiagent Systems.
PY - 2024
Y1 - 2024
N2 - Reinforcement learning algorithms need exploration to learn. However, unsupervised exploration prevents the deployment of such algorithms on safety-critical tasks and limits real-world deployment. In this paper, we propose a new algorithm called Ensemble Model Predictive Safety Certification that combines model-based deep reinforcement learning with tube-based model predictive control to correct the actions taken by a learning agent, keeping safety constraint violations at a minimum through planning. Our approach aims to reduce the amount of prior knowledge about the actual system by requiring only offline data generated by a safe controller. Our results show that we can achieve significantly fewer constraint violations than comparable reinforcement learning methods.
AB - Reinforcement learning algorithms need exploration to learn. However, unsupervised exploration prevents the deployment of such algorithms on safety-critical tasks and limits real-world deployment. In this paper, we propose a new algorithm called Ensemble Model Predictive Safety Certification that combines model-based deep reinforcement learning with tube-based model predictive control to correct the actions taken by a learning agent, keeping safety constraint violations at a minimum through planning. Our approach aims to reduce the amount of prior knowledge about the actual system by requiring only offline data generated by a safe controller. Our results show that we can achieve significantly fewer constraint violations than comparable reinforcement learning methods.
KW - Model-based Learning
KW - Predictive Safety Filter
KW - Reinforcement Learning
KW - Safe Exploration
KW - Safe Reinforcement Learning
UR - http://www.scopus.com/inward/record.url?scp=85196363361&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85196363361
SN - 1548-8403
VL - 2024-May
SP - 724
EP - 732
JO - Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
JF - Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
T2 - 23rd International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2024
Y2 - 6 May 2024 through 10 May 2024
ER -