End-to-End Spiking Neural Network for Speech Recognition Using Resonating Input Neurons

Daniel Auge, Julian Hille, Felix Kreutz, Etienne Mueller, Alois Knoll

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

The growing demand for complex computations in edge devices requires the development of algorithms and hardware accelerators that are powerful while remaining energy-efficient. A possible solution are spiking neural networks, as they have been demonstrated to be energy-efficient in several data processing and classification tasks when executed on specialized neuromorphic hardware. In the field of speech processing, they are especially suited for the online classification of audio streams due to their strong temporal affinity. However, so far, there has been a lack of emphasis on small-scale networks that will ultimately fit into restricted neuromorphic implementations. We propose the use of resonating neurons as an input layer to spiking neural networks for online audio classification to enable an end-to-end solution. We compare different architectures to the established method of using mel-frequency-based spectral features. With our approach, spiking neural networks can be directly used without additional preprocessing, thereby making them suitable for simple continuous low-power analysis of audio streams. We compare the classification accuracy of different network architectures with ours in a keyword spotting benchmark to demonstrate the performance of our approach.

Original languageEnglish
Title of host publicationArtificial Neural Networks and Machine Learning – ICANN 2021 - 30th International Conference on Artificial Neural Networks, Proceedings
EditorsIgor Farkaš, Paolo Masulli, Sebastian Otte, Stefan Wermter
PublisherSpringer Science and Business Media Deutschland GmbH
Pages245-256
Number of pages12
ISBN (Print)9783030863821
DOIs
StatePublished - 2021
Event30th International Conference on Artificial Neural Networks, ICANN 2021 - Virtual, Online
Duration: 14 Sep 202117 Sep 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12895 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference30th International Conference on Artificial Neural Networks, ICANN 2021
CityVirtual, Online
Period14/09/2117/09/21

Keywords

  • Keyword detection
  • Speech processing
  • Spiking neural networks

Fingerprint

Dive into the research topics of 'End-to-End Spiking Neural Network for Speech Recognition Using Resonating Input Neurons'. Together they form a unique fingerprint.

Cite this