TY - GEN
T1 - Modular Production Control with Multi-Agent Deep Q-Learning
AU - Gankin, Dennis
AU - Mayer, Sebastian
AU - Zinn, Jonas
AU - Vogel-Heuser, Birgit
AU - Endisch, Christian
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - The automotive industry is increasingly focusing on product customization. The concept of Modular Production addresses this issue by providing more flexibility in production with Automated Guided Vehicles transporting products between modular workstations. The added complexity of Modular Production Control calls for approaches that can handle the scheduling complexity while also minimizing production costs. As a result, literature has focused on two promising approaches: Deep Reinforcement Learning and Multi-Agent Systems. Both approaches have their advantages. Especially in complex, large-scale production environments with random breakdowns, those two fields have been seldomly combined, though. As a result, this article aims to fill that research gap by introducing a Deep Reinforcement Learning Multi-Agent System approach for Modular Production Control. We introduce a reward design incentivizing agents to achieve maximal throughput. In addition, we show that the method learns optimal behavior even in a large-scale production environment with random machine breakdowns. Lastly, we compare the Multi-Agent System to a single-agent implementation of the Deep Reinforcement Learning approach and conclude that the Multi-Agent Deep Reinforcement Learning method learns and solves the Modular Production Control problem with the same solution quality as the single agent. Hence, the approach allows to foster MAS benefits such as robustness without losses in the solution quality.
AB - The automotive industry is increasingly focusing on product customization. The concept of Modular Production addresses this issue by providing more flexibility in production with Automated Guided Vehicles transporting products between modular workstations. The added complexity of Modular Production Control calls for approaches that can handle the scheduling complexity while also minimizing production costs. As a result, literature has focused on two promising approaches: Deep Reinforcement Learning and Multi-Agent Systems. Both approaches have their advantages. Especially in complex, large-scale production environments with random breakdowns, those two fields have been seldomly combined, though. As a result, this article aims to fill that research gap by introducing a Deep Reinforcement Learning Multi-Agent System approach for Modular Production Control. We introduce a reward design incentivizing agents to achieve maximal throughput. In addition, we show that the method learns optimal behavior even in a large-scale production environment with random machine breakdowns. Lastly, we compare the Multi-Agent System to a single-agent implementation of the Deep Reinforcement Learning approach and conclude that the Multi-Agent Deep Reinforcement Learning method learns and solves the Modular Production Control problem with the same solution quality as the single agent. Hence, the approach allows to foster MAS benefits such as robustness without losses in the solution quality.
KW - Deep Q-Network
KW - Deep Reinforcement Learning
KW - Modular Production Control
KW - Multi-Agent System
KW - Production Control
KW - Reinforcement Learning
KW - Scheduling
UR - http://www.scopus.com/inward/record.url?scp=85122935434&partnerID=8YFLogxK
U2 - 10.1109/ETFA45728.2021.9613177
DO - 10.1109/ETFA45728.2021.9613177
M3 - Conference contribution
AN - SCOPUS:85122935434
T3 - IEEE International Conference on Emerging Technologies and Factory Automation, ETFA
BT - Proceedings - 2021 26th IEEE International Conference on Emerging Technologies and Factory Automation, ETFA 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 26th IEEE International Conference on Emerging Technologies and Factory Automation, ETFA 2021
Y2 - 7 September 2021 through 10 September 2021
ER -