TY - GEN
T1 - Causally Deterministic Markov Decision Processes
AU - Akshay, S.
AU - Meggendorfer, Tobias
AU - Thiagarajan, P. S.
N1 - Publisher Copyright:
© S. Akshay, Tobias Meggendorfer, and P. S. Thiagarajan.
PY - 2024/9
Y1 - 2024/9
N2 - Probabilistic systems are often modeled using factored versions of Markov decision processes (MDPs), where the states are composed out of the local states of components and each transition involves only a small subset of the components. Concurrency arises naturally in such systems. Our goal is to exploit concurrency when analyzing factored MDPs (FMDPs). To do so, we first formulate FMDPs in a way that aids this goal and port several notions from concurrency theory to the probabilistic setting of MDPs. In particular, we provide a concurrent semantics for FMDPs based on the classical notion of event structures, thereby cleanly separating causality, concurrency, and conflicts that arise from stochastic choices. We further identify the subclass of causally deterministic FMDPs (CMDPs), where non-determinism arises solely due to concurrency. Using our event structure semantics, we show that in CMDPs, local reachability properties can be computed using a “greedy” strategy. Finally, we implement our ideas in a prototype and apply it to four models, confirming the potential for substantial improvements over state-of-the-art methods.
AB - Probabilistic systems are often modeled using factored versions of Markov decision processes (MDPs), where the states are composed out of the local states of components and each transition involves only a small subset of the components. Concurrency arises naturally in such systems. Our goal is to exploit concurrency when analyzing factored MDPs (FMDPs). To do so, we first formulate FMDPs in a way that aids this goal and port several notions from concurrency theory to the probabilistic setting of MDPs. In particular, we provide a concurrent semantics for FMDPs based on the classical notion of event structures, thereby cleanly separating causality, concurrency, and conflicts that arise from stochastic choices. We further identify the subclass of causally deterministic FMDPs (CMDPs), where non-determinism arises solely due to concurrency. Using our event structure semantics, we show that in CMDPs, local reachability properties can be computed using a “greedy” strategy. Finally, we implement our ideas in a prototype and apply it to four models, confirming the potential for substantial improvements over state-of-the-art methods.
KW - MDPs
KW - causal determinism
KW - distribution
UR - http://www.scopus.com/inward/record.url?scp=85203497922&partnerID=8YFLogxK
U2 - 10.4230/LIPIcs.CONCUR.2024.6
DO - 10.4230/LIPIcs.CONCUR.2024.6
M3 - Conference contribution
AN - SCOPUS:85203497922
T3 - Leibniz International Proceedings in Informatics, LIPIcs
BT - 35th International Conference on Concurrency Theory, CONCUR 2024
A2 - Majumdar, Rupak
A2 - Silva, Alexandra
PB - Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
T2 - 35th International Conference on Concurrency Theory, CONCUR 2024
Y2 - 9 September 2024 through 13 September 2024
ER -