TY - GEN
T1 - Machine learning based online performance prediction for runtime parallelization and task scheduling
AU - Li, Jiangtian
AU - Ma, Xiaosong
AU - Singh, Karan
AU - Schulz, Martin
AU - De Supinski, Bronis
AU - McKee, Sally A.
PY - 2009
Y1 - 2009
N2 - With the emerging many-core paradigm, parallel programming must extend beyond its traditional realm of scientific applications. Converting existing sequential applications as well as developing next-generation software requires assistance from hardware, compilers and runtime systems to exploit parallelism transparently within applications. These systems must decompose applications into tasks that can be executed in parallel and then schedule those tasks to minimize load imbalance. However, many systems lack a priori knowledge about the execution time of all tasks to perform effective load balancing with low scheduling overhead. In this paper, we approach this fundamental problem using machine learning techniques first to generate performance models for all tasks and then applying those models to perform automatic performance prediction across program executions. We also extend an existing scheduling algorithm to use generated task cost estimates for online task partitioning and scheduling. We implement the above techniques in the pR framework, which transparently parallelizes scripts in the popular R language, and evaluate their performance and overhead with both a realworld application and a large number of synthetic representative test scripts. Our experimental results show that our proposed approach significantly improves task partitioning and scheduling, with maximum improvements of 21.8%, 40.3% and 22.1% and average improvements of 15.9%, 16.9% and 4.2% for LMM (a real R application) and synthetic test cases with independent and dependent tasks, respectively.
AB - With the emerging many-core paradigm, parallel programming must extend beyond its traditional realm of scientific applications. Converting existing sequential applications as well as developing next-generation software requires assistance from hardware, compilers and runtime systems to exploit parallelism transparently within applications. These systems must decompose applications into tasks that can be executed in parallel and then schedule those tasks to minimize load imbalance. However, many systems lack a priori knowledge about the execution time of all tasks to perform effective load balancing with low scheduling overhead. In this paper, we approach this fundamental problem using machine learning techniques first to generate performance models for all tasks and then applying those models to perform automatic performance prediction across program executions. We also extend an existing scheduling algorithm to use generated task cost estimates for online task partitioning and scheduling. We implement the above techniques in the pR framework, which transparently parallelizes scripts in the popular R language, and evaluate their performance and overhead with both a realworld application and a large number of synthetic representative test scripts. Our experimental results show that our proposed approach significantly improves task partitioning and scheduling, with maximum improvements of 21.8%, 40.3% and 22.1% and average improvements of 15.9%, 16.9% and 4.2% for LMM (a real R application) and synthetic test cases with independent and dependent tasks, respectively.
KW - Artificial Neural Networks
KW - Automatic Task Scheduling
KW - Performance Prediction
KW - Scripting Languages
UR - http://www.scopus.com/inward/record.url?scp=70349185990&partnerID=8YFLogxK
U2 - 10.1109/ISPASS.2009.4919641
DO - 10.1109/ISPASS.2009.4919641
M3 - Conference contribution
AN - SCOPUS:70349185990
SN - 9781424441846
T3 - ISPASS 2009 - International Symposium on Performance Analysis of Systems and Software
SP - 89
EP - 100
BT - ISPASS 2009 - International Symposium on Performance Analysis of Systems and Software
T2 - International Symposium on Performance Analysis of Systems and Software, ISPASS 2009
Y2 - 26 April 2009 through 28 April 2009
ER -