Compact Bilinear Deep Features For Environmental Sound Recognition

Fatih Demir, Abdulkadir Sengur, Hao Lu, Shahin Amiriparian, Nicholas Cummins, Björn Schuller

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Environmental sound recognition (ESR) has extensive various civilian and military applications. Existing ESR methods generally tackle this problem by employing various signal processing and machine learning methods. Herein, an ESR paradigm based on feature extraction from pre-trained deep convolutional neural networks (CNN), the derivation of higher-order statistics by compact bilinear pooling and normalisation. In particular, we consider two deep ImageNet architectures for deep feature extraction, and the Random Maclaurin (RM) to produce the compact bilinear features. A support vector machine (SVM) with homogeneous mapping is used in the classification stage. Two publicly available environmental sound datasets are used to verify the efficacy of the approach namely, ESC-50 and ESC-10. We compare the proposed method with various previous state-of-the-art methods. Presented results indicate the suitability of the higher-order statistics of Deep Spectrum representations for ESR classification tasks.

Original languageEnglish
Title of host publication2018 International Conference on Artificial Intelligence and Data Processing, IDAP 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781538668788
DOIs
StatePublished - 21 Jan 2019
Externally publishedYes
Event2018 International Conference on Artificial Intelligence and Data Processing, IDAP 2018 - Malatya, Turkey
Duration: 28 Sep 201830 Sep 2018

Publication series

Name2018 International Conference on Artificial Intelligence and Data Processing, IDAP 2018

Conference

Conference2018 International Conference on Artificial Intelligence and Data Processing, IDAP 2018
Country/TerritoryTurkey
CityMalatya
Period28/09/1830/09/18

Keywords

  • Environmental sound classification
  • compact bilinear pooling
  • convolutional neural networks
  • deep spectrum features

Fingerprint

Dive into the research topics of 'Compact Bilinear Deep Features For Environmental Sound Recognition'. Together they form a unique fingerprint.

Cite this