Abstract
With the rapid advancements in the Artificial Intelligence area, Neural Networks (NNs) became the driving force both in general purpose and embedded computing domains. Especially, resource constrained embedded systems progressively rely on multiple NNs to provide on the spot sophisticated services. Nevertheless, supporting NN-based workloads is challenging due to the enormous computational and energy requirements. Exploiting the inherent error resiliency of NNs, significant research focuses on designing approximate Convolutional NN (CNN) inference accelerators, demonstrating that, for negligible accuracy loss, they satisfy tight latency, power, and temperature constraints. This chapter provides a comprehensive discussion of different aspects of approximate CNN implementations.
| Original language | English |
|---|---|
| Title of host publication | Approximate Computing |
| Publisher | Springer International Publishing |
| Pages | 429-450 |
| Number of pages | 22 |
| ISBN (Electronic) | 9783030983475 |
| ISBN (Print) | 9783030983468 |
| DOIs | |
| State | Published - 1 Jan 2022 |
| Externally published | Yes |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Keywords
- Approximate computing
- Approximate multiplier
- Circuit aging
- Control variate
- Convolution
- Convolutional neural network (CNN)
- Deep neural network (DNN)
- Energy efficiency
- Error analysis
- Error correction
- Error estimation
- Inference
- Low-power
- Neural Processing Unit (NPU)
- Quantization
- Reconfigurable approximation
- Reliability-aware approximation
- Software-hardware codesign
- Temperature-aware approximation
- Thermal management
Fingerprint
Dive into the research topics of 'Enabling Efficient Inference of Convolutional Neural Networks via Approximation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver