Dynamic workload management for very large data warehouses - Juggling feathers and bowling balls

Stefan Krompass, Harumi Kuno, Umeshwar Dayal, Alfons Kemper

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

31 Scopus citations

Abstract

Workload management for business intelligence (BI) queries poses different challenges than those addressed in the online transaction processing (OLTP) context. The fundamental problem is that the execution times of BI queries can range from milliseconds to hours, and it is difficult to estimate these times accurately. Key challenges raised by this problem are how to identify queries that are not performing properly and what to do about them. We propose here a workload management system for controlling the execution of individual queries based on realistic customer service level objectives. In order to validate our proposal, we have implemented an experimental system that includes a dynamic execution controller that leverages fuzzy logic. We present results from a number of experiments that we ran using workloads based on actual industrial workloads and customer objectives that we gathered by interviewing industry practitioners. Our experiments show that even a handful of moderately mis-behaving problem queries can have a significant impact on a workload consisting of thousands of queries. We were surprised when our experiments also demonstrated that false positives - incorrectly identifying a normal query as a problem - can also have significant consequences. For those reasons, it is very important that an execution controller be as accurate as possible - avoiding both false positives and false negatives. Our experiments also validate that our execution controller can markedly improve the execution of a workload that includes problem queries.

Original languageEnglish
Title of host publication33rd International Conference on Very Large Data Bases, VLDB 2007 - Conference Proceedings
EditorsJohannes Gehrke, Christoph Koch, Minos Garofalakis, Karl Aberer, Carl-Christian Kanne, Erich J. Neuhold, Venkatesh Ganti, Wolfgang Klas, Chee-Yong Chan, Divesh Srivastava, Dana Florescu, Anand Deshpande
PublisherAssociation for Computing Machinery, Inc
Pages1105-1115
Number of pages11
ISBN (Electronic)9781595936493
StatePublished - 2007
Event33rd International Conference on Very Large Data Bases, VLDB 2007 - Vienna, Austria
Duration: 23 Sep 200727 Sep 2007

Publication series

Name33rd International Conference on Very Large Data Bases, VLDB 2007 - Conference Proceedings

Conference

Conference33rd International Conference on Very Large Data Bases, VLDB 2007
Country/TerritoryAustria
CityVienna
Period23/09/0727/09/07

Fingerprint

Dive into the research topics of 'Dynamic workload management for very large data warehouses - Juggling feathers and bowling balls'. Together they form a unique fingerprint.

Cite this