SOS: Safe, optimal and small strategies for hybrid markov decision processes

Pranav Ashok, Jan Křetínský, Kim Guldstrand Larsen, Adrien Le Coënt, Jakob Haahr Taankvist, Maximilian Weininger

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

24 Zitate (Scopus)

Abstract

For hybrid Markov decision processes, Stratego can compute strategies that are safe for a given safety property and (in the limit) optimal for a given cost function. Unfortunately, these strategies cannot be exported easily since they are computed as a very long list. In this paper, we demonstrate methods to learn compact representations of the strategies in the form of decision trees. These decision trees are much smaller, more understandable, and can easily be exported as code that can be loaded into embedded systems. Despite the size compression and actual differences to the original strategy, we provide guarantees on both safety and optimality of the decision-tree strategy. On the top, we show how to obtain yet smaller representations, which are still guaranteed safe, but achieve a desired trade-off between size and optimality.

OriginalspracheEnglisch
TitelQuantitative Evaluation of Systems - 16th International Conference, QEST 2019, Proceedings
Redakteure/-innenDavid Parker, Verena Wolf
Herausgeber (Verlag)Springer Verlag
Seiten147-164
Seitenumfang18
ISBN (Print)9783030302801
DOIs
PublikationsstatusVeröffentlicht - 2019
Veranstaltung16th International Conference on Quantitative Evaluation of Systems, QEST 2019 - Glasgow, Großbritannien/Vereinigtes Königreich
Dauer: 10 Sept. 201912 Sept. 2019

Publikationsreihe

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band11785 LNCS
ISSN (Print)0302-9743
ISSN (elektronisch)1611-3349

Konferenz

Konferenz16th International Conference on Quantitative Evaluation of Systems, QEST 2019
Land/GebietGroßbritannien/Vereinigtes Königreich
OrtGlasgow
Zeitraum10/09/1912/09/19

Fingerprint

Untersuchen Sie die Forschungsthemen von „SOS: Safe, optimal and small strategies for hybrid markov decision processes“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren