Parallel Golomb-Rice Decoder with 8-bit Unary Decoding for Weight Compression in TinyML Applications

Mounika Vaddeboina, Endri Kaja, Alper Yilmayer, Sebastian Prebeck, Wolfgang Ecker

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

1 Zitat (Scopus)

Abstract

Due to the recent advances in AI, the requirement for Artificial Intelligence (AI) has increased exponentially in the domain of Internet of Things (IoT). Running Deep Neural Networks (DNNs) on edge devices gives the advantage of privacy, security, and lower latency. It is challenging to deploy them on embedded devices with constrained hardware resources since a lot of compute and memory resources are required. Memory access contributes to the majority of the energy requirements on edge devices. Although data compression plays a critical role in reducing storage and memory bandwidth requirements, most of the hardware decoders are inefficient in terms of power, area, and throughput. In this work, a hardware Parallel Golomb-Rice decoder is presented that can decode 8-bits of unary encoded data every cycle. The design has been integrated with a Neural Network (NN) accelerator and experimented with state-of-the-art benchmark models. Lossless compression is performed with an offline Golomb-Rice encoder. It encodes the weights of each layer with an optimum Golomb-Rice parameter. Applied to the benchmarks Anomaly Detection, Image Classification and Visual Wake Words the memory access during inference is reduced by 26.8%, 6.62% and 5.54% respectively. The decoder dissipates 0.4216 mW of power and delivers an average throughput of 860 MBps. The design has been synthesised with 40 nm technology and compared with state-of-the-art works.

OriginalspracheEnglisch
TitelProceedings - 2023 26th Euromicro Conference on Digital System Design, DSD 2023
Redakteure/-innenSmail Niar, Hamza Ouarnoughi, Amund Skavhaug
Herausgeber (Verlag)Institute of Electrical and Electronics Engineers Inc.
Seiten227-234
Seitenumfang8
ISBN (elektronisch)9798350344196
DOIs
PublikationsstatusVeröffentlicht - 2023
Veranstaltung26th Euromicro Conference on Digital System Design, DSD 2023 - Golem, Albanien
Dauer: 6 Sept. 20238 Sept. 2023

Publikationsreihe

NameProceedings - 2023 26th Euromicro Conference on Digital System Design, DSD 2023

Konferenz

Konferenz26th Euromicro Conference on Digital System Design, DSD 2023
Land/GebietAlbanien
OrtGolem
Zeitraum6/09/238/09/23

Fingerprint

Untersuchen Sie die Forschungsthemen von „Parallel Golomb-Rice Decoder with 8-bit Unary Decoding for Weight Compression in TinyML Applications“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren