Overcoming scalability challenges for tool daemon launching

Dong H. Ahn, Dorian C. Arnold, Bronis R. De Supinski, Gregory L. Lee, Barton P. Miller, Martin Schulz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

18 Scopus citations

Abstract

Many tools that target parallel and distributed environments must co-locate a set of daemons with the distributed processes of the target application. However, efficient and portable deployment of these daemons on large scale systems is an unsolved problem. We overcome this gap with LaunchMON, a scalable, robust, portable, secure, and general purpose infrastructure for launching tool daemons. Its API allows tool builders to identify all processes of a target job, launch daemons on the relevant nodes and control daemon interaction. Our results show that LaunchMON scales to very large daemon counts and substantially enhances performance over existing ad hoc mechanisms.

Original languageEnglish
Title of host publicationProceedings - 37th International Conference on Parallel Processing, ICPP 2008
Pages578-585
Number of pages8
DOIs
StatePublished - 2008
Externally publishedYes
Event37th International Conference on Parallel Processing, ICPP 2008 - Portland, OR, United States
Duration: 9 Sep 200812 Sep 2008

Publication series

NameProceedings of the International Conference on Parallel Processing
ISSN (Print)0190-3918

Conference

Conference37th International Conference on Parallel Processing, ICPP 2008
Country/TerritoryUnited States
CityPortland, OR
Period9/09/0812/09/08

Fingerprint

Dive into the research topics of 'Overcoming scalability challenges for tool daemon launching'. Together they form a unique fingerprint.

Cite this