TY - JOUR
T1 - Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems
AU - Enders, Tobias
AU - Harrison, James
AU - Pavone, Marco
AU - Schiffer, Maximilian
N1 - Publisher Copyright:
© 2023 T. Enders, J. Harrison, M. Pavone & M. Schiffer.
PY - 2023
Y1 - 2023
N2 - We consider the sequential decision-making problem of making proactive request assignment and rejection decisions for a profit-maximizing operator of an autonomous mobility on demand system. We formalize this problem as a Markov decision process and propose a novel combination of multi-agent Soft Actor-Critic and weighted bipartite matching to obtain an anticipative control policy. Thereby, we factorize the operator's otherwise intractable action space, but still obtain a globally coordinated decision. Experiments based on real-world taxi data show that our method outperforms state of the art benchmarks with respect to performance, stability, and computational tractability.
AB - We consider the sequential decision-making problem of making proactive request assignment and rejection decisions for a profit-maximizing operator of an autonomous mobility on demand system. We formalize this problem as a Markov decision process and propose a novel combination of multi-agent Soft Actor-Critic and weighted bipartite matching to obtain an anticipative control policy. Thereby, we factorize the operator's otherwise intractable action space, but still obtain a globally coordinated decision. Experiments based on real-world taxi data show that our method outperforms state of the art benchmarks with respect to performance, stability, and computational tractability.
KW - autonomous mobility on demand
KW - deep reinforcement learning
KW - hybrid learning and optimization
KW - multi-agent learning
UR - http://www.scopus.com/inward/record.url?scp=85172905256&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85172905256
SN - 2640-3498
VL - 211
SP - 1284
EP - 1296
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
T2 - 5th Annual Conference on Learning for Dynamics and Control, L4DC 2023
Y2 - 15 June 2023 through 16 June 2023
ER -