Umbra: A Disk-Based System with In-Memory Performance

Thomas Neumann, Michael Freitag

Research output: Contribution to conferencePaperpeer-review

120 Scopus citations

Abstract

The increases in main-memory sizes over the last decade have made pure in-memory database systems feasible, and in-memory systems offer unprecedented performance. However, DRAM is still relatively expensive, and the growth of main-memory sizes has slowed down. In contrast, the prices for SSDs have fallen substantially in the last years, and their read bandwidth has increased to gigabytes per second. This makes it attractive to combine a large in-memory buffer with fast SSDs as storage devices, combining the excellent performance for the in-memory working set with the scalability of a disk-based system. In this paper we present the Umbra system, an evolution of the pure in-memory HyPer system towards a disk-based, or rather SSD-based, system. We show that by introducing a novel low-overhead buffer manager with variable-size pages we can achieve comparable performance to an in-memory database system for the cached working set, while handling accesses to uncached data gracefully. We discuss the changes and techniques that were necessary to handle the out-of-memory case gracefully and with low overhead, offering insights into the design of a memory optimized disk-based system.

Original languageEnglish
StatePublished - 2020
Event10th Annual Conference on Innovative Data Systems Research, CIDR 2020 - Amsterdam, Netherlands
Duration: 12 Jan 202015 Jan 2020

Conference

Conference10th Annual Conference on Innovative Data Systems Research, CIDR 2020
Country/TerritoryNetherlands
CityAmsterdam
Period12/01/2015/01/20

Fingerprint

Dive into the research topics of 'Umbra: A Disk-Based System with In-Memory Performance'. Together they form a unique fingerprint.

Cite this