TY - JOUR
T1 - First steps towards real-world traffic signal control optimisation by reinforcement learning
AU - Meess, Henri
AU - Gerner, Jeremias
AU - Hein, Daniel
AU - Schmidtner, Stefanie
AU - Elger, Gordon
AU - Bogenberger, Klaus
N1 - Publisher Copyright:
© 2024 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.
PY - 2024
Y1 - 2024
N2 - Enhancing traffic signal optimisation has the potential to improve urban traffic flow without the need for expensive infrastructure modifications. While reinforcement learning (RL) techniques have demonstrated their effectiveness in simulations, their real-world implementation is still a challenge. Real-world systems need to be developed that guarantee a deployable action definition for real traffic systems while prioritising safety constraints and robust policies. This paper introduces a method to overcome this challenge by introducing a novel action definition that optimises parameter-level control programmes designed by traffic engineers. The complete proposed framework consists of a traffic situation estimation, a feature extractor, and a system that enables training on estimates of real-world traffic situations. Further multimodal optimisation, scalability, and continuous training after deployment could be achieved. The first simulative tests using this action definition show an average improvement of more than 20% in traffic flow compared to the baseline–the corresponding pre-optimised real-world control.
AB - Enhancing traffic signal optimisation has the potential to improve urban traffic flow without the need for expensive infrastructure modifications. While reinforcement learning (RL) techniques have demonstrated their effectiveness in simulations, their real-world implementation is still a challenge. Real-world systems need to be developed that guarantee a deployable action definition for real traffic systems while prioritising safety constraints and robust policies. This paper introduces a method to overcome this challenge by introducing a novel action definition that optimises parameter-level control programmes designed by traffic engineers. The complete proposed framework consists of a traffic situation estimation, a feature extractor, and a system that enables training on estimates of real-world traffic situations. Further multimodal optimisation, scalability, and continuous training after deployment could be achieved. The first simulative tests using this action definition show an average improvement of more than 20% in traffic flow compared to the baseline–the corresponding pre-optimised real-world control.
KW - DRL
KW - MARL
KW - Multi-agent reinforcement learning in real-world
KW - multimodal traffic
KW - traffic optimisation
UR - http://www.scopus.com/inward/record.url?scp=85196809271&partnerID=8YFLogxK
U2 - 10.1080/17477778.2024.2364715
DO - 10.1080/17477778.2024.2364715
M3 - Article
AN - SCOPUS:85196809271
SN - 1747-7778
VL - 18
SP - 957
EP - 972
JO - Journal of Simulation
JF - Journal of Simulation
IS - 6
ER -