Abstract
Data access latencies and bandwidth bottlenecks frequently represent major limiting factors for the computational effectiveness of multi- and many-core processor architectures. This chapter deals with two conceptually complementary approaches to ensure that the data to be processed and the processing entities remain spatially confined, even under fluctuating workload and application compositions: region-based cache coherence (RBCC) and near-memory acceleration (NMA). In addition, so-called near-cache accelerators (NCA) are proposed and investigated. As the cache hierarchy is often bypassed when integrating in- or near-memory solutions, they need to be properly synchronized with the rest of the system, i.e., maintaining coherence and consistency between normal cores and NMAs. The efficiency and scalability of cache coherence plays an important role for tile-based MPSoCs. Both RBCC and NCA/NMA contribute to tackling the MPSoC data locality challenge for native as well as hybrid forms of distributed shared memory and shared memory models and architectures.
Original language | English |
---|---|
Title of host publication | Multi-Processor System-on-Chip 1 |
Subtitle of host publication | Architectures |
Publisher | wiley |
Pages | 87-117 |
Number of pages | 31 |
ISBN (Electronic) | 9781119818298 |
ISBN (Print) | 9781789450217 |
DOIs | |
State | Published - 26 Mar 2021 |
Keywords
- Distributed shared memory
- MPSoC data locality challenge
- Near-memory acceleration
- Region-based cache coherence
- Shared memory models