SQL- and operator-centric data analytics in relational main-Memory databases

Linnea Passing, Manuel Then, Nina Hubig, Harald Lang, Michael Schreier, Stephan Günnemann, Alfons Kemper, Thomas Neumann

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

33 Zitate (Scopus)

Abstract

Data volume and complexity continue to increase, as does the need for insight into data. Today, data management and data analytics are most often conducted in separate systems: database systems and dedicated analytics systems. This separation leads to time- and resource-consuming data transfer, stale data, and complex IT architectures. In this paper we show that relational main-memory database systems are capable of executing analytical algorithms in a fully transactional environment while still exceeding performance of state-of-the-art analytical systems rendering the division of data management and data analytics unnecessary. We classify and assess multiple ways of integrating data analytics in database systems. Based on this assessment, we extend SQL with a non-appending iteration construct that provides an important building block for analytical algorithms while retaining the high expressiveness of the original language. Furthermore, we propose the integration of analytics operators directly into the database core, where algorithms can be highly tuned for modern hardware. These operators can be parameterized with our novel user-defined lambda expressions. As we integrate lambda expressions into SQL instead of proposing a new proprietary query language, we ensure usability for diverse groups of users. Additionally, we carry out an extensive experimental evaluation of our approaches in HyPer, our full-fledged SQL main-memory database system, and show their superior performance in comparison to dedicated solutions.

OriginalspracheEnglisch
TitelAdvances in Database Technology - EDBT 2017
Untertitel20th International Conference on Extending Database Technology, Proceedings
Redakteure/-innenBernhard Mitschang, Volker Markl, Sebastian Bress, Periklis Andritsos, Kai-Uwe Sattler, Salvatore Orlando
Herausgeber (Verlag)OpenProceedings.org
Seiten84-95
Seitenumfang12
ISBN (elektronisch)9783893180738
DOIs
PublikationsstatusVeröffentlicht - 2017
Veranstaltung20th International Conference on Extending Database Technology, EDBT 2017 - Venice, Italien
Dauer: 21 März 201724 März 2017

Publikationsreihe

NameAdvances in Database Technology - EDBT
Band2017-March
ISSN (elektronisch)2367-2005

Konferenz

Konferenz20th International Conference on Extending Database Technology, EDBT 2017
Land/GebietItalien
OrtVenice
Zeitraum21/03/1724/03/17

Fingerprint

Untersuchen Sie die Forschungsthemen von „SQL- and operator-centric data analytics in relational main-Memory databases“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren