TY - GEN
T1 - The case for hybrid succinct data structures
AU - Anneser, Christoph
AU - Kipf, Andreas
AU - Lang, Harald
AU - Neumann, Thomas
AU - Kemper, Alfons
N1 - Publisher Copyright:
© 2020 Copyright held by the owner/author(s). Published in Proceedings of the 23rd International Conference on Extending Database Technology (EDBT), March 30-April 2, 2020, ISBN 978-3-89318-083-7 on OpenProceedings.org. Distribution of this paper is permitted under the terms of the Creative Commons license CC-by-nc-nd 4.0.
PY - 2020
Y1 - 2020
N2 - One of the biggest challenges in data management is to retain the high performance of in-memory processing with the ever increasing data volumes. Recent years have shown that the amount of collected data is increasing at a faster pace than DRAM capacities. Many state-of-the-art index data structures are optimized for performance rather than for low space consumption and quickly exceed the limited main-memory capacities when indexing larger data sets. Succinct data structures on the other hand allow for space efficient indexing, but compromise performance while still being orders of magnitude faster than disk-based data structures. In this work, we propose a novel framework that combines state-of-the-art indexes with succinct data structures to form new hybrid succinct data structures. These hybrids enable fine-grained trade-offs between space and performance. Frequently accessed parts of an index, e.g. the upper levels of a tree, are thereby maintained in performance-optimized structures whereas less frequently accessed parts have a space-optimized representation. Our evaluation shows that our approach can significantly reduce the amount of used space by up to 50% (resp. 90% for our compressed version) while retaining 93% (resp. 87%) of the performance.
AB - One of the biggest challenges in data management is to retain the high performance of in-memory processing with the ever increasing data volumes. Recent years have shown that the amount of collected data is increasing at a faster pace than DRAM capacities. Many state-of-the-art index data structures are optimized for performance rather than for low space consumption and quickly exceed the limited main-memory capacities when indexing larger data sets. Succinct data structures on the other hand allow for space efficient indexing, but compromise performance while still being orders of magnitude faster than disk-based data structures. In this work, we propose a novel framework that combines state-of-the-art indexes with succinct data structures to form new hybrid succinct data structures. These hybrids enable fine-grained trade-offs between space and performance. Frequently accessed parts of an index, e.g. the upper levels of a tree, are thereby maintained in performance-optimized structures whereas less frequently accessed parts have a space-optimized representation. Our evaluation shows that our approach can significantly reduce the amount of used space by up to 50% (resp. 90% for our compressed version) while retaining 93% (resp. 87%) of the performance.
UR - http://www.scopus.com/inward/record.url?scp=85084192458&partnerID=8YFLogxK
U2 - 10.5441/002/edbt.2020.40
DO - 10.5441/002/edbt.2020.40
M3 - Conference contribution
AN - SCOPUS:85084192458
T3 - Advances in Database Technology - EDBT
SP - 391
EP - 394
BT - Advances in Database Technology - EDBT 2020
A2 - Bonifati, Angela
A2 - Zhou, Yongluan
A2 - Vaz Salles, Marcos Antonio
A2 - Bohm, Alexander
A2 - Olteanu, Dan
A2 - Fletcher, George
A2 - Khan, Arijit
A2 - Yang, Bin
PB - OpenProceedings.org
T2 - 23rd International Conference on Extending Database Technology, EDBT 2020
Y2 - 30 March 2020 through 2 April 2020
ER -