Abstract
In the age of massive data, databases are getting less convenient for data exploration tasks due to the costly loading phase. Still, the highly optimized query engines of database systems are greatly beneficial for the performance of data analysis tasks. With our research, we want to bridge this gap and provide paramount analytical performance without the need of static data loading. Our approach enables the integration of Parquet files - one of the most used columnar file format in the data lake context - into the data processing pipeline of a database system in a convenient way. We allow end-users to benefit from the database system performance without a costly and time-consuming loading phase.
| Originalsprache | Englisch |
|---|---|
| Fachzeitschrift | CEUR Workshop Proceedings |
| Jahrgang | 3651 |
| Publikationsstatus | Veröffentlicht - 2024 |
| Veranstaltung | 2024 Workshops of the EDBT/ICDT Joint Conference, EDBT/ICDT-WS 2024 - Paestum, Italien Dauer: 25 März 2024 → 25 März 2024 |
Fingerprint
Untersuchen Sie die Forschungsthemen von „Bridging the Gap between Data Lakes and RDBMSs Efficient Query Processing with Parquet“. Zusammen bilden sie einen einzigartigen Fingerprint.Dieses zitieren
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver