A CNN-GRU approach to capture time-frequency pattern interdependence for snore sound classification

Jianhong Wang, Harald Strömfelt, Björn W. Schuller

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Scopus citations

Abstract

In this work, we propose an architecture named DualConvGRU Network to overcome the INTERPEECH 2017 ComParE Snoring sub-challenge. In this network, we devise two new models: the Dual Convolutional Layer, which is applied to a spectrogram to extract features; and the Channel Slice Model, which reprocess the extracted features. The first amalgamates an ensemble of information collected from two types of convolutional operations, with differing kernel dimension on the frequency axis and equal dimension on the time axis. Secondly, the dependencies within the convolutional layer channel axes are learnt, by feeding channel slices into a Gated Recurrent Unit (GRU) layer. By taking this approach, convolutional layers can be connected to sequential models without the use of fully connected layers. Compared with other state-of-the-art methods delivered to INTERPEECH 2017 ComParE Snoring sub-challenge, our method ranks 5th on performance of test data. Moreover, we are the only competitor to train a deep learning model solely on the provided training data, except for Baseline. The performance of our model exceeds the baseline too much.

Original languageEnglish
Title of host publication2018 26th European Signal Processing Conference, EUSIPCO 2018
PublisherEuropean Signal Processing Conference, EUSIPCO
Pages997-1001
Number of pages5
ISBN (Electronic)9789082797015
DOIs
StatePublished - 29 Nov 2018
Externally publishedYes
Event26th European Signal Processing Conference, EUSIPCO 2018 - Rome, Italy
Duration: 3 Sep 20187 Sep 2018

Publication series

NameEuropean Signal Processing Conference
Volume2018-September
ISSN (Print)2219-5491

Conference

Conference26th European Signal Processing Conference, EUSIPCO 2018
Country/TerritoryItaly
CityRome
Period3/09/187/09/18

Keywords

  • Channel Slice Model
  • Dual Convolutional Layers
  • DualConvGRU Network

Fingerprint

Dive into the research topics of 'A CNN-GRU approach to capture time-frequency pattern interdependence for snore sound classification'. Together they form a unique fingerprint.

Cite this