Massively parallel numa-aware hash joins

Harald Lang, Viktor Leis, Martina Cezara Albutiu, Thomas Neumann, Alfons Kemper

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

15 Scopus citations

Abstract

Driven by the two main hardware trends increasing main memory and massively parallel multi-core processing in the past few years, there has been much research effort in parallelizing well-known join algorithms. However, the non-uniform memory access (NUMA) of these architectures to main memory has only gained limited attention in the design of these algorithms. We study recent proposals of main memory hash join implementations and identify their major performance problems on NUMA architectures. We then develop a NUMA-aware hash join for massively parallel environments, and show how the specific implementation details affect the performance on a NUMA system. Our experimental evaluation shows that a carefully engineered hash join implementation outperforms previous high performance hash joins by a factor of more than two, resulting in an unprecedented throughput of 3/4 billion join argument quintuples per second.

Original languageEnglish
Title of host publicationIn Memory Data Management and Analysis - 1st and 2nd International Workshops, IMDM 2013, IMDM 2014, Revised Selected Papers
EditorsThomas Neumann, Andrew Pavlo, Justin Levandoski, Arun Jagatheesan
PublisherSpringer Verlag
Pages3-14
Number of pages12
ISBN (Electronic)9783319139593
DOIs
StatePublished - 2015
Event1st International Workshop on In-Memory Data Management and Analytics, IMDM 2013 and 2nd International Workshop on In-Memory Data Management and Analytics, IMDM 2014 - Hongzhou, China
Duration: 1 Sep 20141 Sep 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8921
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference1st International Workshop on In-Memory Data Management and Analytics, IMDM 2013 and 2nd International Workshop on In-Memory Data Management and Analytics, IMDM 2014
Country/TerritoryChina
CityHongzhou
Period1/09/141/09/14

Fingerprint

Dive into the research topics of 'Massively parallel numa-aware hash joins'. Together they form a unique fingerprint.

Cite this