Algebraic query optimization for distributed top-k queries

Thomas Neumann, Sebastian Michel

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Distributed top-κ query processing is increasingly becoming an essential functionality in a large number of emerging application classes. This paper addresses the efficient algebraic optimization of top-κ queries in wide-area distributed data repositories where the index lists for the attribute values (or text terms) of a query are distributed across a number of data peers and the computational costs include network latency, bandwidth consumption, and local peer work.We use a dynamic programming approach to find the optimal execution plan using compact data synopses for selectivity estimation that is the basis for our cost model. The optimized query is executed in a hierarchical way involving a small and fixed number of communication phases. We have performed experiments on real web data that show the benefits of distributed top-κ query optimization both in network resource consumption and query response time.

Original languageEnglish
Title of host publicationDatenbanksysteme in Business, Technologie und Web, BTW 2007 - 12th Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme" (DBIS), Proceedings
Pages324-343
Number of pages20
StatePublished - 2007
Externally publishedYes
Event12th Symposium of the German Informatics Society Section "Databases and Information Systems" (DBIS) on Database Systems in Business, Technology and Web, BTW 2007 - Aachen, Germany
Duration: 7 Mar 20079 Mar 2007

Publication series

NameDatenbanksysteme in Business, Technologie und Web, BTW 2007 - 12th Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme" (DBIS), Proceedings

Conference

Conference12th Symposium of the German Informatics Society Section "Databases and Information Systems" (DBIS) on Database Systems in Business, Technology and Web, BTW 2007
Country/TerritoryGermany
CityAachen
Period7/03/079/03/07

Fingerprint

Dive into the research topics of 'Algebraic query optimization for distributed top-k queries'. Together they form a unique fingerprint.

Cite this