Machine learning based online performance prediction for runtime parallelization and task scheduling

Jiangtian Li, Xiaosong Ma, Karan Singh, Martin Schulz, Bronis De Supinski, Sally A. McKee

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

43 Scopus citations

Abstract

With the emerging many-core paradigm, parallel programming must extend beyond its traditional realm of scientific applications. Converting existing sequential applications as well as developing next-generation software requires assistance from hardware, compilers and runtime systems to exploit parallelism transparently within applications. These systems must decompose applications into tasks that can be executed in parallel and then schedule those tasks to minimize load imbalance. However, many systems lack a priori knowledge about the execution time of all tasks to perform effective load balancing with low scheduling overhead. In this paper, we approach this fundamental problem using machine learning techniques first to generate performance models for all tasks and then applying those models to perform automatic performance prediction across program executions. We also extend an existing scheduling algorithm to use generated task cost estimates for online task partitioning and scheduling. We implement the above techniques in the pR framework, which transparently parallelizes scripts in the popular R language, and evaluate their performance and overhead with both a realworld application and a large number of synthetic representative test scripts. Our experimental results show that our proposed approach significantly improves task partitioning and scheduling, with maximum improvements of 21.8%, 40.3% and 22.1% and average improvements of 15.9%, 16.9% and 4.2% for LMM (a real R application) and synthetic test cases with independent and dependent tasks, respectively.

Original languageEnglish
Title of host publicationISPASS 2009 - International Symposium on Performance Analysis of Systems and Software
Pages89-100
Number of pages12
DOIs
StatePublished - 2009
Externally publishedYes
EventInternational Symposium on Performance Analysis of Systems and Software, ISPASS 2009 - Boston, MA, United States
Duration: 26 Apr 200928 Apr 2009

Publication series

NameISPASS 2009 - International Symposium on Performance Analysis of Systems and Software

Conference

ConferenceInternational Symposium on Performance Analysis of Systems and Software, ISPASS 2009
Country/TerritoryUnited States
CityBoston, MA
Period26/04/0928/04/09

Keywords

  • Artificial Neural Networks
  • Automatic Task Scheduling
  • Performance Prediction
  • Scripting Languages

Fingerprint

Dive into the research topics of 'Machine learning based online performance prediction for runtime parallelization and task scheduling'. Together they form a unique fingerprint.

Cite this