Efficient Software-Implemented HW Fault Tolerance for TinyML Inference in Safety-critical Applications

Uzair Sharif, Daniel Mueller-Gritschneder, Rafael Stahl, Ulf Schlichtmann

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

5 Zitate (Scopus)

Abstract

TinyML research has mainly focused on optimizing neural network inference in terms of latency, code-size and energy-use for efficient execution on low-power micro-controller units (MCUs). However, distinctive design challenges emerge in safety-critical applications, for example in small unmanned autonomous vehicles such as drones, due to the susceptibility of off-the-shelf MCU devices to soft-errors. We propose three new techniques to protect TinyML inference against random soft errors with the target to reduce run-time overhead: one for protecting fully-connected layers; one adaptation of existing algorithmic fault tolerance techniques to depth-wise convolutions; and an efficient technique to protect the so-called epilogues within TinyML layers. Integrating these layer-wise methods, we derive a full-inference hardening solution for TinyML that achieves run-time efficient soft-error resilience. We evaluate our proposed solution on MLPerf-Tiny benchmarks. Our experimental results show that competitive resilience can be achieved compared with currently available methods, while reducing run-time overheads by 120% for one fully-connected neural network (NN); 20% for the two CNNs with depth-wise convolutions; and 2% for standard CNN. Additionally, we propose selective hardening which reduces the incurred run-time overhead further by 2x for the studied CNNs by focusing exclusively on avoiding mispredictions.

OriginalspracheEnglisch
Titel2023 Design, Automation and Test in Europe Conference and Exhibition, DATE 2023 - Proceedings
Herausgeber (Verlag)Institute of Electrical and Electronics Engineers Inc.
ISBN (elektronisch)9783981926378
DOIs
PublikationsstatusVeröffentlicht - 2023
Veranstaltung2023 Design, Automation and Test in Europe Conference and Exhibition, DATE 2023 - Antwerp, Belgien
Dauer: 17 Apr. 202319 Apr. 2023

Publikationsreihe

NameProceedings -Design, Automation and Test in Europe, DATE
Band2023-April
ISSN (Print)1530-1591

Konferenz

Konferenz2023 Design, Automation and Test in Europe Conference and Exhibition, DATE 2023
Land/GebietBelgien
OrtAntwerp
Zeitraum17/04/2319/04/23

Fingerprint

Untersuchen Sie die Forschungsthemen von „Efficient Software-Implemented HW Fault Tolerance for TinyML Inference in Safety-critical Applications“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren