TY - GEN
T1 - PaGoRi:A Scalable Parallel Golomb-Rice Decoder
AU - Vaddeboina, Mounika
AU - Kaja, Endri
AU - Yilmazer, Alper
AU - Ghosh, Uttal
AU - Ecker, Wolfgang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Deep Neural Networks (DNNs) have created opportunities to address real-world issues and expand the application of Artificial Intelligence (AI). Despite significant accuracy enhancements, DNNs pose a challenge when deployed on resource-limited edge devices commonly used in Internet of Things (IoT) applications. Inference execution of the DNNs requires accessing millions of parameters responsible for most energy consumption. Compression of weights is one possible solution, but most of the existing hardware decompression units could be more efficient in terms of power, area, and energy. This paper presents a scalable version of a hardware-efficient Parallel Golomb-Rice decoder (PaGoRi). The decoder has been integrated with an industry-strength Neural Network (NN) accelerator and evaluated with three TinyML benchmarks. The PaGoRi decoder achieves optimal trade-offs between power consumption and throughput, supporting decoding capacities of four and eight weights, consuming 0.43 mW and 0.79 mW of power, respectively, while achieving a throughput of 888 MBps and 1.3 GBps, respectively.
AB - Deep Neural Networks (DNNs) have created opportunities to address real-world issues and expand the application of Artificial Intelligence (AI). Despite significant accuracy enhancements, DNNs pose a challenge when deployed on resource-limited edge devices commonly used in Internet of Things (IoT) applications. Inference execution of the DNNs requires accessing millions of parameters responsible for most energy consumption. Compression of weights is one possible solution, but most of the existing hardware decompression units could be more efficient in terms of power, area, and energy. This paper presents a scalable version of a hardware-efficient Parallel Golomb-Rice decoder (PaGoRi). The decoder has been integrated with an industry-strength Neural Network (NN) accelerator and evaluated with three TinyML benchmarks. The PaGoRi decoder achieves optimal trade-offs between power consumption and throughput, supporting decoding capacities of four and eight weights, consuming 0.43 mW and 0.79 mW of power, respectively, while achieving a throughput of 888 MBps and 1.3 GBps, respectively.
KW - Deep Neural Networks
KW - Golomb-Rice coding
KW - Internet-of-Things
KW - Neural Network accelerators
KW - Parallel decoders
UR - http://www.scopus.com/inward/record.url?scp=85192847406&partnerID=8YFLogxK
U2 - 10.1109/DDECS60919.2024.10508926
DO - 10.1109/DDECS60919.2024.10508926
M3 - Conference contribution
AN - SCOPUS:85192847406
T3 - Proceedings - 2024 27th International Symposium on Design and Diagnostics of Electronic Circuits and Systems, DDECS 2024
SP - 67
EP - 72
BT - Proceedings - 2024 27th International Symposium on Design and Diagnostics of Electronic Circuits and Systems, DDECS 2024
A2 - Deniziak, Stanislaw
A2 - Sitek, Pawel
A2 - Jenihhin, Maksim
A2 - Steininger, Andreas
A2 - Scholzel, Mario
A2 - Mrazek, Vojtech
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 27th International Symposium on Design and Diagnostics of Electronic Circuits and Sytems, DDECS 2024
Y2 - 3 April 2024 through 5 April 2024
ER -