Declarative Sub-Operators for Universal Data Processing

Michael Jungmair, Jana Giceva

Publikation: Beitrag in FachzeitschriftKonferenzartikelBegutachtung

Abstract

Data processing systems face the challenge of supporting increasingly diverse workloads efficiently. At the same time, they are already bloated with internal complexity, and it is not clear how new hardware can be supported sustainably. In this paper, we aim to resolve these issues by proposing a unified abstraction layer based on declarative sub-operators in addition to relational operators. By exposing this layer to users, they can express their non-relational workloads declaratively with sub-operators. Furthermore, the proposed sub-operators decouple the semantic implementation of operators from the efficient imperative implementation, reducing the implementation complexity for relational operators. Finally, through fine-grained automatic optimizations, the declarative sub-operators allow for automatic morsel-driven parallelism. We demonstrate the benefits not only by providing a specific set of sub-operators but also implementing them in a compiling query engine. With thorough evaluation and analysis, we show that we can support a richer set of workloads while retaining the development complexity low and being competitive in performance even with specialized systems.

OriginalspracheEnglisch
Seiten (von - bis)3461-3474
Seitenumfang14
FachzeitschriftProceedings of the VLDB Endowment
Jahrgang16
Ausgabenummer11
DOIs
PublikationsstatusVeröffentlicht - 2023
Veranstaltung49th International Conference on Very Large Data Bases, VLDB 2023 - Vancouver, Kanada
Dauer: 28 Aug. 20231 Sept. 2023

Fingerprint

Untersuchen Sie die Forschungsthemen von „Declarative Sub-Operators for Universal Data Processing“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren