TY - GEN
T1 - Comparing scalability prediction strategies on an SMP of CMPs
AU - Singh, Karan
AU - Curtis-Maury, Matthew
AU - McKee, Sally A.
AU - Blagojević, Filip
AU - Nikolopoulos, Dimitrios S.
AU - De Supinski, Bronis R.
AU - Schulz, Martin
PY - 2010
Y1 - 2010
N2 - Diminishing performance returns and increasing power consumption of single-threaded processors have made chip multiprocessors (CMPs) an industry imperative. Unfortunately, poor software/hardware interaction and bottlenecks in shared hardware structures can prevent scaling to many cores. In fact, adding a core may harm performance and increase power consumption. Given these observations, we compare two approaches to predicting parallel application scalability: multiple linear regression and artificial neural networks (ANNs). We throttle concurrency to levels with higher predicted power/performance efficiency. We perform experiments on a state-of-the-art, dual-processor, quad-core platform, showing that both methodologies achieve high accuracy and identify energy-efficient concurrency levels in multithreaded scientific applications. The ANN approach has advantages, but the simpler regression-based model achieves slightly higher accuracy and performance. The approaches exhibit median error of 7.5% and 5.6%, and improve performance by an average of 7.4% and 9.5%, respectively.
AB - Diminishing performance returns and increasing power consumption of single-threaded processors have made chip multiprocessors (CMPs) an industry imperative. Unfortunately, poor software/hardware interaction and bottlenecks in shared hardware structures can prevent scaling to many cores. In fact, adding a core may harm performance and increase power consumption. Given these observations, we compare two approaches to predicting parallel application scalability: multiple linear regression and artificial neural networks (ANNs). We throttle concurrency to levels with higher predicted power/performance efficiency. We perform experiments on a state-of-the-art, dual-processor, quad-core platform, showing that both methodologies achieve high accuracy and identify energy-efficient concurrency levels in multithreaded scientific applications. The ANN approach has advantages, but the simpler regression-based model achieves slightly higher accuracy and performance. The approaches exhibit median error of 7.5% and 5.6%, and improve performance by an average of 7.4% and 9.5%, respectively.
UR - http://www.scopus.com/inward/record.url?scp=78349251674&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-15277-1_14
DO - 10.1007/978-3-642-15277-1_14
M3 - Conference contribution
AN - SCOPUS:78349251674
SN - 3642152767
SN - 9783642152764
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 143
EP - 155
BT - Euro-Par 2010 Parallel Processing - 16th International Euro-Par Conference, Proceedings
T2 - 16th International Euro-Par Conference on Parallel Processing, Euro-Par 2010
Y2 - 31 August 2010 through 3 September 2010
ER -