TY - GEN
T1 - Autonomic Orchestration of in-Situ and in-Transit Data Analytics For Simulation Studies
AU - Du, Xiaorui
AU - Meng, Zhuoxiao
AU - Siguenza-Torres, Anibal
AU - Knoll, Alois
AU - Pimpini, Adriano
AU - Piccione, Andrea
AU - Bortoli, Stefano
AU - Pellegrini, Alessandro
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Modern parallel/distributed simulations can produce large amounts of data. The historical approach of performing analyses at the end of the simulation is unlikely to cope with modern, extremely large-scale analytics jobs. Indeed, the I/O subsystem can quickly become the global bottleneck. Similarly, processing on-the-fly the data produced by simulations can significantly impair the performance in terms of computational capacity and network load. We present a methodology and reference architecture for constructing an autonomic control system to determine at runtime the best placement for data processing (on simulation nodes or a set of external nodes). This allows for a good tradeoff between the load on the simulation's critical path and the data communication system. Our preliminary experimentation shows that autonomic orchestration is crucial to improve the global performance of a data analysis system, especially when the simulation node's rate of data production varies during simulation.
AB - Modern parallel/distributed simulations can produce large amounts of data. The historical approach of performing analyses at the end of the simulation is unlikely to cope with modern, extremely large-scale analytics jobs. Indeed, the I/O subsystem can quickly become the global bottleneck. Similarly, processing on-the-fly the data produced by simulations can significantly impair the performance in terms of computational capacity and network load. We present a methodology and reference architecture for constructing an autonomic control system to determine at runtime the best placement for data processing (on simulation nodes or a set of external nodes). This allows for a good tradeoff between the load on the simulation's critical path and the data communication system. Our preliminary experimentation shows that autonomic orchestration is crucial to improve the global performance of a data analysis system, especially when the simulation node's rate of data production varies during simulation.
UR - http://www.scopus.com/inward/record.url?scp=85185379395&partnerID=8YFLogxK
U2 - 10.1109/WSC60868.2023.10408191
DO - 10.1109/WSC60868.2023.10408191
M3 - Conference contribution
AN - SCOPUS:85185379395
T3 - Proceedings - Winter Simulation Conference
SP - 781
EP - 792
BT - 2023 Winter Simulation Conference, WSC 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 Winter Simulation Conference, WSC 2023
Y2 - 10 December 2023 through 13 December 2023
ER -