Optimizing Data Compression: Enhanced Golomb-Rice Encoding with Parallel Decoding Strategies for TinyML Models

Mounika Vaddeboina, Alper Yilmayer, Wolfgang Ecker

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Deep Neural Networks (DNNs) offer possibilities for tackling practical challenges and broadening the scope of Artificial Intelligence (AI) applications. The demanding memory requirements of present-day neural networks can be attributed to the rising intricacy of network architectures. These designs encompass multiple layers with an extensive number of parameters, leading to heightened demands on memory storage. The energy consumption during the inference execution of DNNs is predominantly attributed to the access and processing of these parameters. To tackle the significant size of models integrated into Internet of Things (IoT) devices, a promising strategy involves diminishing the bit width of weights. This paper introduces an improved version of Golomb-Rice (GR) encoder and an optimized Parallel Golomb-Rice decoder that can support sparse and non-sparse DNNs. To evaluate the encoder's and decoder's efficiency, we conducted two sets of experiments using three TinyML benchmarks, one without pruning and the other incorporating pruning. The results highlight that the encoder demonstrates a Compression-Ratio (CR) superior to that of Huffman encoding, and the decoder exhibits an energy efficiency of up to 2.6 TBps/W and 2.7 TBps/W for four- and eight-weight decoding, respectively.

Original languageEnglish
Title of host publicationProceedings - 2024 27th Euromicro Conference on Digital System Design, DSD 2024
EditorsTomasz Kryjak, Frederic Petrot
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages487-494
Number of pages8
ISBN (Electronic)9798350380385
DOIs
StatePublished - 2024
Event27th Euromicro Conference on Digital System Design, DSD 2024 - Paris, France
Duration: 28 Aug 202430 Aug 2024

Publication series

NameProceedings - 2024 27th Euromicro Conference on Digital System Design, DSD 2024

Conference

Conference27th Euromicro Conference on Digital System Design, DSD 2024
Country/TerritoryFrance
CityParis
Period28/08/2430/08/24

Keywords

  • Deep Neural Networks
  • Golomb-Rice coding
  • Internet-of-Things
  • Neural Network accelerators
  • Parallel decoders

Fingerprint

Dive into the research topics of 'Optimizing Data Compression: Enhanced Golomb-Rice Encoding with Parallel Decoding Strategies for TinyML Models'. Together they form a unique fingerprint.

Cite this