TY - JOUR
T1 - Efficiently learning a distributed control policy in cyber-physical production systems via simulation optimization
AU - Zou, Minjie
AU - Huang, Edward
AU - Vogel-Heuser, Birgit
AU - Chen, Chun Hung
N1 - Publisher Copyright:
© 2021 Elsevier Ltd. All rights reserved.
PY - 2020
Y1 - 2020
N2 - The manufacturing industry is becoming more dynamic than ever. The limitations of non-deterministic network delays and real-time requirements call for decentralized control. For such dynamic and complex systems, learning methods stand out as a transformational technology to have a more flexible control solution. Using simulation for learning enables the description of highly dynamic systems and provides samples without occupying a real facility. However, it requires prohibitively expensive computation. In this paper, we argue that simulation optimization is a powerful tool that can be applied to various simulation-based learning processes for tremendous effects. We proposed an efficient policy learning framework, ROSA (Reinforcement-learning enhanced by Optimal Simulation Allocation), with unprecedented integration of learning, control, and simulation optimization techniques, which can drastically improve the efficiency of policy learning in a cyber-physical system. A proof-of-concept is implemented on a conveyer-switch network, demonstrating how ROSA can be applied for efficient policy learning, with an emphasis on the industrial distributed control system.
AB - The manufacturing industry is becoming more dynamic than ever. The limitations of non-deterministic network delays and real-time requirements call for decentralized control. For such dynamic and complex systems, learning methods stand out as a transformational technology to have a more flexible control solution. Using simulation for learning enables the description of highly dynamic systems and provides samples without occupying a real facility. However, it requires prohibitively expensive computation. In this paper, we argue that simulation optimization is a powerful tool that can be applied to various simulation-based learning processes for tremendous effects. We proposed an efficient policy learning framework, ROSA (Reinforcement-learning enhanced by Optimal Simulation Allocation), with unprecedented integration of learning, control, and simulation optimization techniques, which can drastically improve the efficiency of policy learning in a cyber-physical system. A proof-of-concept is implemented on a conveyer-switch network, demonstrating how ROSA can be applied for efficient policy learning, with an emphasis on the industrial distributed control system.
KW - Cyber-physical system
KW - Distributed control
KW - Multi-agent
KW - Reinforcement learning
KW - Simulation optimization
UR - http://www.scopus.com/inward/record.url?scp=85105262676&partnerID=8YFLogxK
U2 - 10.1109/CASE48305.2020.9249228
DO - 10.1109/CASE48305.2020.9249228
M3 - Conference article
AN - SCOPUS:85105262676
SN - 2161-8070
VL - 2020-January
SP - 645
EP - 651
JO - IEEE International Conference on Automation Science and Engineering
JF - IEEE International Conference on Automation Science and Engineering
M1 - 9249228
T2 - 16th IEEE International Conference on Automation Science and Engineering, CASE 2020
Y2 - 20 August 2020 through 21 August 2020
ER -