Community training: Partitioning schemes in good shape for federated data grids

Tobias Scholl, Richard Kuntschke, Angelika Reiser, Alfons Kemper

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

8 Zitate (Scopus)

Abstract

In federated Data Grids, individual institutions share their data sets within a community to enable collaborative data analysis. Data access needs to be provided in a scalable fashion since in most e-science communities, data sets do not only grow exponentially but also experience an increasing popularity. If data autonomy is retained, each individual institution has to ensure efficient access to its data. Analyzing application-specific data properties (such as data skew) or query characteristics (query patterns) and distributing data within Data Grids accordingly, allows for improved throughput for data-intensive applications and enables better load-balancing between shared resources. We propose a framework for investigating application-specific index structures for creating suitable partitioning schemes. We evaluate two variants of the well-known Quadtree data structure as well as the Zones approach, an index structure from the astrophysics domain, according to several criteria. Our framework improves data access within federated Data Grids and can be combined with well-established Grid methods as well as with more flexible P2P technologies.

OriginalspracheEnglisch
TitelProceedings - e-Science 2007, 3rd IEEE International Conference on e-Science and Grid Computing
Seiten195-202
Seitenumfang8
DOIs
PublikationsstatusVeröffentlicht - 2007
VeranstaltungE-Science 2007, 3rd IEEE International Conference on E-Science and Grid Computing - Bangalore, Indien
Dauer: 10 Dez. 200713 Dez. 2007

Publikationsreihe

NameProceedings - e-Science 2007, 3rd IEEE International Conference on e-Science and Grid Computing

Konferenz

KonferenzE-Science 2007, 3rd IEEE International Conference on E-Science and Grid Computing
Land/GebietIndien
OrtBangalore
Zeitraum10/12/0713/12/07

Fingerprint

Untersuchen Sie die Forschungsthemen von „Community training: Partitioning schemes in good shape for federated data grids“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren