Skip to main navigation Skip to search Skip to main content

BigBench specification V0.1 BigBench: An industry standard benchmark for big data analytics

  • Tilmann Rabl
  • , Ahmad Ghazal
  • , Minqing Hu
  • , Alain Crolotte
  • , Francois Raab
  • , Meikel Poess
  • , Hans Arno Jacobsen
  • University of Toronto
  • Teradata Corp.
  • InfoSizing, Inc.
  • Oracle America, Inc.

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

In this article, we present the specification of BigBench, an end-to-end big data benchmark proposal. BigBench models a retail product supplier. The benchmark proposal covers a data model and a set of big data specific queries. BigBench's synthetic data generator addresses the variety, velocity and volume aspects of big data workloads. The structured part of the BigBench data model is adopted from the TPC-DS benchmark. In addition, the structured schema is enriched with semi-structured and unstructured data components that are common in a retail product supplier environment. This specification contains the full query set as well as the data model.

Original languageEnglish
Title of host publicationSpecifying Big Data Benchmarks - First Workshop, WBDB 2012, and Second Workshop, WBDB 2012, Revised Selected Papers
PublisherSpringer Verlag
Pages164-201
Number of pages38
ISBN (Print)9783642539732
DOIs
StatePublished - 2014
Externally publishedYes
Event2nd Workshop on Specifying Big Data Benchmarks, WBDB 2012 - Pune, India
Duration: 17 Dec 201218 Dec 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8163 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference2nd Workshop on Specifying Big Data Benchmarks, WBDB 2012
Country/TerritoryIndia
CityPune
Period17/12/1218/12/12

Fingerprint

Dive into the research topics of 'BigBench specification V0.1 BigBench: An industry standard benchmark for big data analytics'. Together they form a unique fingerprint.

Cite this