Resource-aware optimization of DNNs for embedded applications

Alexander Frickenstein, Christian Unger, Walter Stechele

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Despite their outstanding success in solving complex computer vision problems, Deep Neural Networks (DNNs) still require high-performance hardware for real-time inference. Therefore they are not applicable to low-cost embedded hardware, where memory resources, computational performance and power consumption are restricted. Furthermore, current approaches of fitting neural networks to embedded hardware are time consuming, inducing slow development cycles. To address these drawbacks and satisfy the demands of embedded hardware, this paper proposes a computationally efficient magnitude-based pruning scheme, based on a half-interval search, combined with effective weight sharing, fixed-point quantization, and lossless compression. The proposed solution can be utilized to generate an optimized model, either with respect to memory demand or execution time. For instance, the memory demand of LeNet is compressed about 385×. VGG16 is pruned by about 14.5×, whilst its computational costs are reduced by about 1.6× for a CPU-based application and 4.8× for an FPGA one.

Original languageEnglish
Title of host publicationProceedings - 2019 16th Conference on Computer and Robot Vision, CRV 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages17-24
Number of pages8
ISBN (Electronic)9781728118383
DOIs
StatePublished - May 2019
Event16th Conference on Computer and Robot Vision, CRV 2019 - Kingston, Canada
Duration: 29 May 201931 May 2019

Publication series

NameProceedings - 2019 16th Conference on Computer and Robot Vision, CRV 2019

Conference

Conference16th Conference on Computer and Robot Vision, CRV 2019
Country/TerritoryCanada
CityKingston
Period29/05/1931/05/19

Keywords

  • CNN
  • Coding
  • Embedded HW
  • Pruning
  • Quantization
  • Weight-Sharing

Fingerprint

Dive into the research topics of 'Resource-aware optimization of DNNs for embedded applications'. Together they form a unique fingerprint.

Cite this