TY - GEN
T1 - GLS Tracing
T2 - 8th IEEE Nordic Circuits and Systems Conference, NORCAS 2022
AU - Nolte, Lars
AU - Twardzik, Tim
AU - Jalier, Camille
AU - Huang, Zhigang
AU - Shi, Jiyuan
AU - Wild, Thomas
AU - Herkersdorf, Andreas
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Profiling and tracing mechanisms are vital to identify performance bottlenecks for the optimization of modern multi-threaded applications [1]. While profiling generates a summary of averaged performance metrics, tracing provides a full record (trace) of timestamped events [1]. Hence, tracing is mandatory for detailed timing analysis but is usually difficult to implement. First, the generated traces can get large which usually results in a high overhead to record them. Second, lossless traces (no events get lost) with precise timestamps for correct event ordering across large multi-core systems are challenges for common tracing mechanisms. As a complementary approach to existing tracing mechanisms, we introduce GLS Tracing (Gem5-based Low-intrusive Software Tracing), which overcomes the mentioned challenges. GLS Tracing combines the flexibility and fine granular resolution of software tracepoints (TP) with the low-intrusive system analysis capabilities of the Gem5 simulator. Using new Gem5 pseudo-instructions (Trace Point Operation, TPOP), the time-consuming recording of a native TP in software tracing is offloaded to the Gem5 Full-System simulator, thus reducing the number of CPU cycles to be simulated for this tracing. This new tracing capability is integrated into the native Linux kernel software TP infrastructure TRACE_EVENT, resulting in an average overhead reduction of 93% per recorded TP (from on average 2300 clock cycles down to 160 clock cycles). In addition, a less flexible but almost non-intrusive TP is defined that adds only 5 to 10 cycles overhead per TP. This provides a lossless low-intrusive tracing solution for Gem5, which can be used in both user-space and kernel-space functions.
AB - Profiling and tracing mechanisms are vital to identify performance bottlenecks for the optimization of modern multi-threaded applications [1]. While profiling generates a summary of averaged performance metrics, tracing provides a full record (trace) of timestamped events [1]. Hence, tracing is mandatory for detailed timing analysis but is usually difficult to implement. First, the generated traces can get large which usually results in a high overhead to record them. Second, lossless traces (no events get lost) with precise timestamps for correct event ordering across large multi-core systems are challenges for common tracing mechanisms. As a complementary approach to existing tracing mechanisms, we introduce GLS Tracing (Gem5-based Low-intrusive Software Tracing), which overcomes the mentioned challenges. GLS Tracing combines the flexibility and fine granular resolution of software tracepoints (TP) with the low-intrusive system analysis capabilities of the Gem5 simulator. Using new Gem5 pseudo-instructions (Trace Point Operation, TPOP), the time-consuming recording of a native TP in software tracing is offloaded to the Gem5 Full-System simulator, thus reducing the number of CPU cycles to be simulated for this tracing. This new tracing capability is integrated into the native Linux kernel software TP infrastructure TRACE_EVENT, resulting in an average overhead reduction of 93% per recorded TP (from on average 2300 clock cycles down to 160 clock cycles). In addition, a less flexible but almost non-intrusive TP is defined that adds only 5 to 10 cycles overhead per TP. This provides a lossless low-intrusive tracing solution for Gem5, which can be used in both user-space and kernel-space functions.
KW - Gem5
KW - Linux
KW - Tracing
KW - kernel-space
KW - user-space
UR - http://www.scopus.com/inward/record.url?scp=85142433392&partnerID=8YFLogxK
U2 - 10.1109/NorCAS57515.2022.9934111
DO - 10.1109/NorCAS57515.2022.9934111
M3 - Conference contribution
AN - SCOPUS:85142433392
T3 - 2022 IEEE Nordic Circuits and Systems Conference, NORCAS 2022 - Proceedings
BT - 2022 IEEE Nordic Circuits and Systems Conference, NORCAS 2022 - Proceedings
A2 - Nurmi, Jari
A2 - Wisland, Dag T.
A2 - Aunet, Snorre
A2 - Kjelgaard, Kristian
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 25 October 2022 through 26 October 2022
ER -