TY - GEN
T1 - On the Effectiveness of Quantization and Pruning on the Performance of FPGAs-based NN Temperature Estimation
AU - Krishna Rao Muvva, Veera Venkata Ram Murali
AU - Rapp, Martin
AU - Henkel, Joerg
AU - Amrouch, Hussam
AU - Wolf, Marilyn
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/8/30
Y1 - 2021/8/30
N2 - A well-functioning thermal management system on the chip requires knowledge of the current temperature and the potential changes in temperature in the near future. This information is important for ensuring proactive thermal management on the chip. However, the limited number of sensors on the chip makes it difficult to accomplish this task. Hence we proposed a neural network based approach to predict the temperature map of the chip. To solve the problem, we have implemented two different neural networks, one is a feedforward network and the other uses recurrent neural networks. Our proposed method requires only performance counters measure to predict the temperature map of the chip during the runtime. Each of the two models shows promising results regarding the estimation of the temperature map on the chip. The recurrent neural network outperformed the feedforward neural network. Furthermore, both networks have been quantized, pruned, and the feedforward network has been compiled into FPGA logic. Therefore, the network could be embedded in the chip, whether it be an ASIC or an FPGA.
AB - A well-functioning thermal management system on the chip requires knowledge of the current temperature and the potential changes in temperature in the near future. This information is important for ensuring proactive thermal management on the chip. However, the limited number of sensors on the chip makes it difficult to accomplish this task. Hence we proposed a neural network based approach to predict the temperature map of the chip. To solve the problem, we have implemented two different neural networks, one is a feedforward network and the other uses recurrent neural networks. Our proposed method requires only performance counters measure to predict the temperature map of the chip during the runtime. Each of the two models shows promising results regarding the estimation of the temperature map on the chip. The recurrent neural network outperformed the feedforward neural network. Furthermore, both networks have been quantized, pruned, and the feedforward network has been compiled into FPGA logic. Therefore, the network could be embedded in the chip, whether it be an ASIC or an FPGA.
KW - Deep Learning
KW - FPGA Compilation
KW - Neural Network
KW - Quantization
KW - Thermal Management
UR - http://www.scopus.com/inward/record.url?scp=85115677115&partnerID=8YFLogxK
U2 - 10.1109/MLCAD52597.2021.9531256
DO - 10.1109/MLCAD52597.2021.9531256
M3 - Conference contribution
AN - SCOPUS:85115677115
T3 - 2021 ACM/IEEE 3rd Workshop on Machine Learning for CAD, MLCAD 2021
BT - 2021 ACM/IEEE 3rd Workshop on Machine Learning for CAD, MLCAD 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd ACM/IEEE Workshop on Machine Learning for CAD, MLCAD 2021
Y2 - 30 August 2021 through 3 September 2021
ER -