Minimizing the Communication Cost of Aggregation in Publish/Subscribe Systems

Navneet Kumar Pandey, Kaiwen Zhang, Stephane Weiss, Hans Arno Jacobsen, Roman Vitenberg

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Scopus citations

Abstract

Modern applications for distributed publish/subscribe systems often require stream aggregation capabilities along with rich data filtering. When compared to other distributed systems, aggregation in pub/sub differentiates itself as a complex problem which involves dynamic dissemination paths that are difficult to predict and optimize for a priori, temporal fluctuations in publication rates, and the mixed presence of aggregated and non-aggregated workloads. In this paper, we propose a formalization for the problem of minimizing communication traffic in the context of aggregation in pub/sub. We present a solution to this minimization problem by using a reduction to the well-known problem of minimum vertex cover in a bipartite graph. This solution is optimal under the strong assumption of complete knowledge of future publications. We call the resulting algorithm 'Aggregation Decision, Optimal with Complete Knowledge' (ADOCK). We also show that under a dynamic setting without full knowledge, ADOCK can still be applied to produce a low, yet not necessarily optimal, communication cost. We also devise a computationally cheaper dynamic approach called 'Aggregation Decision with Weighted Publication' (WAD). We compare our solutions experimentally using two real datasets and explore the trade-offs with respect to communication and computation costs.

Original languageEnglish
Title of host publicationProceedings - 2015 IEEE 35th International Conference on Distributed Computing Systems, ICDCS 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages462-473
Number of pages12
ISBN (Electronic)9781467372145
DOIs
StatePublished - 22 Jul 2015
Externally publishedYes
Event35th IEEE International Conference on Distributed Computing Systems, ICDCS 2015 - Columbus, United States
Duration: 29 Jun 20152 Jul 2015

Publication series

NameProceedings - International Conference on Distributed Computing Systems
Volume2015-July

Conference

Conference35th IEEE International Conference on Distributed Computing Systems, ICDCS 2015
Country/TerritoryUnited States
CityColumbus
Period29/06/152/07/15

Keywords

  • Aggregation
  • Distributed event based system
  • Publish/Subscribe system

Fingerprint

Dive into the research topics of 'Minimizing the Communication Cost of Aggregation in Publish/Subscribe Systems'. Together they form a unique fingerprint.

Cite this