Algebraic Query Optimization for Distributed Top-k Queries

Thomas Neumann, Sebastian Michel

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Distributed top-k query processing is increasingly becoming an essential functionality in a large number of emerging application classes. This paper addresses the efficient algebraic optimization of top-k queries in wide-area distributed data repositories where the index lists for the attribute values (or text terms) of a query are distributed across a number of data peers and the computational costs include network latency, bandwidth consumption, and local peer work. We use a dynamic programming approach to find the optimal execution plan using compact data synopses for selectivity estimation that is the basis for our cost model. The optimized query is executed in a hierarchical way involving a small and fixed number of communication phases. We have performed experiments on real web data that show the benefits of distributed top-k query optimization both in network resource consumption and query response time.

Original languageEnglish
Title of host publicationDatenbanksysteme in Business, Technologie und Web, BTW 2007, 12. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2007, Proceedings
EditorsAlfons Kemper, Harald Schoning, Thomas Rose, Matthias Jarke, Thomas Seidl, Christoph Brochhaus
PublisherGesellschaft fur Informatik (GI)
Pages324-343
Number of pages20
ISBN (Electronic)9783885791973
StatePublished - 2007
Externally publishedYes
EventDatenbanksysteme in Business, Technologie und Web, BTW 2007, 12. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2007 - Database Systems for Business, Technology and Web, BTW 2007, 12th Conference of the GI Division "Databases and Information Systems", DBIS 2007 - Aachen, Germany
Duration: 7 Mar 20079 Mar 2007

Publication series

NameLecture Notes in Informatics (LNI), Proceedings - Series of the Gesellschaft fur Informatik (GI)
VolumeP-103
ISSN (Print)1617-5468
ISSN (Electronic)2944-7682

Conference

ConferenceDatenbanksysteme in Business, Technologie und Web, BTW 2007, 12. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2007 - Database Systems for Business, Technology and Web, BTW 2007, 12th Conference of the GI Division "Databases and Information Systems", DBIS 2007
Country/TerritoryGermany
CityAachen
Period7/03/079/03/07

Fingerprint

Dive into the research topics of 'Algebraic Query Optimization for Distributed Top-k Queries'. Together they form a unique fingerprint.

Cite this