Causally Deterministic Markov Decision Processes

S. Akshay, Tobias Meggendorfer, P. S. Thiagarajan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Probabilistic systems are often modeled using factored versions of Markov decision processes (MDPs), where the states are composed out of the local states of components and each transition involves only a small subset of the components. Concurrency arises naturally in such systems. Our goal is to exploit concurrency when analyzing factored MDPs (FMDPs). To do so, we first formulate FMDPs in a way that aids this goal and port several notions from concurrency theory to the probabilistic setting of MDPs. In particular, we provide a concurrent semantics for FMDPs based on the classical notion of event structures, thereby cleanly separating causality, concurrency, and conflicts that arise from stochastic choices. We further identify the subclass of causally deterministic FMDPs (CMDPs), where non-determinism arises solely due to concurrency. Using our event structure semantics, we show that in CMDPs, local reachability properties can be computed using a “greedy” strategy. Finally, we implement our ideas in a prototype and apply it to four models, confirming the potential for substantial improvements over state-of-the-art methods.

Original languageEnglish
Title of host publication35th International Conference on Concurrency Theory, CONCUR 2024
EditorsRupak Majumdar, Alexandra Silva
PublisherSchloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
ISBN (Electronic)9783959773393
DOIs
StatePublished - Sep 2024
Externally publishedYes
Event35th International Conference on Concurrency Theory, CONCUR 2024 - Calgary, Canada
Duration: 9 Sep 202413 Sep 2024

Publication series

NameLeibniz International Proceedings in Informatics, LIPIcs
Volume311
ISSN (Print)1868-8969

Conference

Conference35th International Conference on Concurrency Theory, CONCUR 2024
Country/TerritoryCanada
CityCalgary
Period9/09/2413/09/24

Keywords

  • MDPs
  • causal determinism
  • distribution

Fingerprint

Dive into the research topics of 'Causally Deterministic Markov Decision Processes'. Together they form a unique fingerprint.

Cite this