Abstract
This paper compares various contemporary multicore-based microprocessor architectures from different vendors with different memory interconnects regarding performance, speedup, and parallel efficiency. Sparse matrix decomposition is used as a benchmark application. The example matrix used in the experiments comes from an electrical engineering application, where numerical simulation of physical processes plays an important role in the design of industrial products. Within this context, thread-to-core pinning and cache optimization are two important aspects which are investigated in more detail.
Original language | English |
---|---|
Pages (from-to) | 132-140 |
Number of pages | 9 |
Journal | Journal of Supercomputing |
Volume | 57 |
Issue number | 2 |
DOIs | |
State | Published - Aug 2011 |
Keywords
- Cache optimization
- Multicore
- Performance optimization
- Pinning
- Sparse matrices