TY - GEN
T1 - Optimizing distributed top-k queries
AU - Neumann, Thomas
AU - Bender, Matthias
AU - Michel, Sebastian
AU - Schenkel, Ralf
AU - Triantafillou, Peter
AU - Weikum, Gerhard
PY - 2008
Y1 - 2008
N2 - Top-k query processing is a fundamental building block for efficient ranking in a large number of applications. Efficiency is a central issue, especially for distributed settings, when the data is spread across different nodes in a network. This paper introduces novel optimization methods for top-k aggregation queries in such distributed environments that can be applied to all algorithms that fall into the frameworks of the prior TPUT and KLEE methods. The optimizations address 1) hierarchically grouping input lists into top-k operator trees and optimizing the tree structure, and 2) computing data-adaptive scan depths for different input sources. The paper presents comprehensive experiments with two different real-life datasets, using the ns-2 network simulator for a packet-level simulation of a large Internet-style network.
AB - Top-k query processing is a fundamental building block for efficient ranking in a large number of applications. Efficiency is a central issue, especially for distributed settings, when the data is spread across different nodes in a network. This paper introduces novel optimization methods for top-k aggregation queries in such distributed environments that can be applied to all algorithms that fall into the frameworks of the prior TPUT and KLEE methods. The optimizations address 1) hierarchically grouping input lists into top-k operator trees and optimizing the tree structure, and 2) computing data-adaptive scan depths for different input sources. The paper presents comprehensive experiments with two different real-life datasets, using the ns-2 network simulator for a packet-level simulation of a large Internet-style network.
UR - http://www.scopus.com/inward/record.url?scp=52149095335&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-85481-4_26
DO - 10.1007/978-3-540-85481-4_26
M3 - Conference contribution
AN - SCOPUS:52149095335
SN - 3540854800
SN - 9783540854807
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 337
EP - 349
BT - Web Information Systems Engineering - WISE 2008 - 9th International Conference, Proceedings
T2 - 9th International Conference on Web Information Systems Engineering, WISE 2008
Y2 - 1 September 2008 through 3 September 2008
ER -