TY - GEN
T1 - Simulation of In-Memory Database Workload
T2 - 2021 ACM/SPEC International Conference on Performance Engineering, ICPE 2021
AU - Barnert, Maximilian
AU - Krcmar, Helmut
N1 - Publisher Copyright:
© 2021 ACM.
PY - 2021/4/9
Y1 - 2021/4/9
N2 - In the last years, performance modeling approaches have been proposed to tackle new concepts for modern In-Memory Database Systems (IMDB). While these approaches model specific performance-relevant aspects, workload representation during performance modeling is considered only marginally. Furthermore, the manual integration of workload into modeling approaches comes along with high effort and requires deep domain-specific knowledge. This paper presents our experience in representing workload within performance models for IMDB. In particular, we use a Markov chain-based approach to extract and reflect probabilistic user behavior during performance modeling. An automatic model generation process is integrated to simplify and reduce the effort for transferring workload characteristics from traces to performance models. In an experimental series running analytical and transactional workloads on an IMDB, we compare this approach with two other methods which rely on less granular data to reflect database workload within performance models, namely reproducing the relative invocation frequency of queries and using the same query execution probability. The results reveal a trade-off between accuracy and speed when simulating database workload. Markov chains are the most accurate independent from workload characteristics, but the relative invocation frequency approach is appropriate for scenarios where simulation speed is important.
AB - In the last years, performance modeling approaches have been proposed to tackle new concepts for modern In-Memory Database Systems (IMDB). While these approaches model specific performance-relevant aspects, workload representation during performance modeling is considered only marginally. Furthermore, the manual integration of workload into modeling approaches comes along with high effort and requires deep domain-specific knowledge. This paper presents our experience in representing workload within performance models for IMDB. In particular, we use a Markov chain-based approach to extract and reflect probabilistic user behavior during performance modeling. An automatic model generation process is integrated to simplify and reduce the effort for transferring workload characteristics from traces to performance models. In an experimental series running analytical and transactional workloads on an IMDB, we compare this approach with two other methods which rely on less granular data to reflect database workload within performance models, namely reproducing the relative invocation frequency of queries and using the same query execution probability. The results reveal a trade-off between accuracy and speed when simulating database workload. Markov chains are the most accurate independent from workload characteristics, but the relative invocation frequency approach is appropriate for scenarios where simulation speed is important.
KW - Markov chains
KW - SAP HANA
KW - database workload
KW - in-memory database systems
KW - performance modeling
KW - workload representation
UR - http://www.scopus.com/inward/record.url?scp=85104545876&partnerID=8YFLogxK
U2 - 10.1145/3427921.3450237
DO - 10.1145/3427921.3450237
M3 - Conference contribution
AN - SCOPUS:85104545876
T3 - ICPE 2021 - Proceedings of the ACM/SPEC International Conference on Performance Engineering
SP - 73
EP - 80
BT - ICPE 2021 - Proceedings of the ACM/SPEC International Conference on Performance Engineering
PB - Association for Computing Machinery, Inc
Y2 - 19 April 2021 through 21 April 2021
ER -