Skip to main navigation Skip to search Skip to main content

Enabling Efficient Inference of Convolutional Neural Networks via Approximation

  • Georgios Zervakis
  • , Iraklis Anagnostopoulos
  • , Hussam Amrouch
  • , Jörg Henkel
  • Humanoid Technologies Lab (H2T)
  • Southern Illinois University Carbondale
  • Universität Stuttgart

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

Abstract

With the rapid advancements in the Artificial Intelligence area, Neural Networks (NNs) became the driving force both in general purpose and embedded computing domains. Especially, resource constrained embedded systems progressively rely on multiple NNs to provide on the spot sophisticated services. Nevertheless, supporting NN-based workloads is challenging due to the enormous computational and energy requirements. Exploiting the inherent error resiliency of NNs, significant research focuses on designing approximate Convolutional NN (CNN) inference accelerators, demonstrating that, for negligible accuracy loss, they satisfy tight latency, power, and temperature constraints. This chapter provides a comprehensive discussion of different aspects of approximate CNN implementations.

Original languageEnglish
Title of host publicationApproximate Computing
PublisherSpringer International Publishing
Pages429-450
Number of pages22
ISBN (Electronic)9783030983475
ISBN (Print)9783030983468
DOIs
StatePublished - 1 Jan 2022
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • Approximate computing
  • Approximate multiplier
  • Circuit aging
  • Control variate
  • Convolution
  • Convolutional neural network (CNN)
  • Deep neural network (DNN)
  • Energy efficiency
  • Error analysis
  • Error correction
  • Error estimation
  • Inference
  • Low-power
  • Neural Processing Unit (NPU)
  • Quantization
  • Reconfigurable approximation
  • Reliability-aware approximation
  • Software-hardware codesign
  • Temperature-aware approximation
  • Thermal management

Fingerprint

Dive into the research topics of 'Enabling Efficient Inference of Convolutional Neural Networks via Approximation'. Together they form a unique fingerprint.

Cite this