TY - GEN
T1 - A proposal for a new hardware cache monitoring architecture
AU - Schulz, Martin
AU - Tao, Jie
AU - Jeitner, Jürgen
AU - Karl, Wolfgang
N1 - Publisher Copyright:
Copyright 2002 ACM.
PY - 2002/6/16
Y1 - 2002/6/16
N2 - The analysis of the memory access behavior of applications, an essential step for a successful cache optimization, is a complex task. It needs to be supported with appropriate tools and monitoring facilities. Currently, however, users can only rely on either simulation based approaches, which deliver a large degree of detail but are restricted in their applicability, or on hardware counters embedded into processors, which allow to keep track of very few, mostly global events and hence only provide limited data. In this work a proposal for novel hardware monitoring facility is presented which exhibits both the details of traditional simulations and the low-overhead of hardware counters. Like the latter approach, it is also targeted towards an implementation within the processor for a direct and nonintrusive access to caches and memory busses. Unlike traditional counters, however, it delivers a detailed picture of the complete memory access behavior of applications. This is achieved by generating so-called memory access histograms, which show access frequencies in relation to the applications address space. Such spatial memory access information can then be used for efficient program optimization by focusing on the code and data segments which were found to exhibit a poor cache behavior.
AB - The analysis of the memory access behavior of applications, an essential step for a successful cache optimization, is a complex task. It needs to be supported with appropriate tools and monitoring facilities. Currently, however, users can only rely on either simulation based approaches, which deliver a large degree of detail but are restricted in their applicability, or on hardware counters embedded into processors, which allow to keep track of very few, mostly global events and hence only provide limited data. In this work a proposal for novel hardware monitoring facility is presented which exhibits both the details of traditional simulations and the low-overhead of hardware counters. Like the latter approach, it is also targeted towards an implementation within the processor for a direct and nonintrusive access to caches and memory busses. Unlike traditional counters, however, it delivers a detailed picture of the complete memory access behavior of applications. This is achieved by generating so-called memory access histograms, which show access frequencies in relation to the applications address space. Such spatial memory access information can then be used for efficient program optimization by focusing on the code and data segments which were found to exhibit a poor cache behavior.
UR - http://www.scopus.com/inward/record.url?scp=33645490410&partnerID=8YFLogxK
U2 - 10.1145/773146.773047
DO - 10.1145/773146.773047
M3 - Conference contribution
AN - SCOPUS:33645490410
T3 - Proceedings of the 2002 Workshop on Memory System Performance, MSP 2002
SP - 76
EP - 85
BT - Proceedings of the 2002 Workshop on Memory System Performance, MSP 2002
PB - Association for Computing Machinery, Inc
T2 - 2002 Workshop on Memory System Performance, MSP 2002
Y2 - 16 June 2002
ER -