Porting existing cache-oblivious linear algebra HPC modules to larrabee architecture

Alexander Heinecke, Carsten Trinitis, Josef Weidendorfer

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

3 Zitate (Scopus)

Abstract

Cache-obliviousness represents an important but relatively new concept for cache optimization. As cache-oblivious algorithms perform well on architectures with arbitrary cache configurations, the programming effort required for porting and optimizing for future architectures can be significantly reduced. In [8] and [9], fast parallel cache-oblivious linear algebra modules have been presented. The underlying matrix storing schemes are based on space filling curves. For matrix multiplication, all cache misses can be avoided, whereas for the LU decomposition algorithm the number of cache misses is minimized. It has been shown that the resulting codes work very well on several kinds of systems ranging from laptops to supercomputers. In this paper, we will show that the runtime characteristics of our existing cache-oblivious codes can be preserved on newer Intel processors. Special emphasis is put on the first many-core processor architecture with complete hardware-based cache coherency: The Larrabee Architecture. As the latter is expected to be available as a PCIe card connected to the host system, porting had to take into account transfer of data structures between different memory address spaces. Unfortunately, Larrabee was canceled as a graphics device for 2010, but Intel is expected to outline futher steps about Larrabee during 2010.

OriginalspracheEnglisch
TitelCF 2010 - Proceedings of the 2010 Computing Frontiers Conference
Seiten91-92
Seitenumfang2
DOIs
PublikationsstatusVeröffentlicht - 2010
Veranstaltung7th ACM International Conference on Computing Frontiers, CF'10 - Bertinoro, Italien
Dauer: 17 Mai 201019 Mai 2010

Publikationsreihe

NameCF 2010 - Proceedings of the 2010 Computing Frontiers Conference

Konferenz

Konferenz7th ACM International Conference on Computing Frontiers, CF'10
Land/GebietItalien
OrtBertinoro
Zeitraum17/05/1019/05/10

Fingerprint

Untersuchen Sie die Forschungsthemen von „Porting existing cache-oblivious linear algebra HPC modules to larrabee architecture“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren