Improving generalisation and robustness of acoustic affect recognition

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

Emotion recognition in real-life conditions faces several challenging factors, which most studies on emotion recognition do not consider. Such factors include background noise, varying recording levels, and acoustic properties of the environment, for example. This paper presents a systematic evaluation of the influence of background noise of various types and SNRs, as well as recording level variations on the performance of automatic emotion recognition from speech. Both, natural and spontaneous as well as acted/prototypical emotions are considered. Besides the well known influence of additive noise, a significant influence of the recording level on the recognition performance is observed. Multi-condition learning with various noise types and recording levels is proposed as a way to increase robustness of methods based on standard acoustic feature sets and commonly used classifiers. It is compared to matched conditions learning and is found to be almost on par for many settings.

Original languageEnglish
Title of host publicationICMI'12 - Proceedings of the ACM International Conference on Multimodal Interaction
Pages517-521
Number of pages5
DOIs
StatePublished - 2012
Event14th ACM International Conference on Multimodal Interaction, ICMI 2012 - Santa Monica, CA, United States
Duration: 22 Oct 201226 Oct 2012

Publication series

NameICMI'12 - Proceedings of the ACM International Conference on Multimodal Interaction

Conference

Conference14th ACM International Conference on Multimodal Interaction, ICMI 2012
Country/TerritoryUnited States
CitySanta Monica, CA
Period22/10/1226/10/12

Keywords

  • Emotion recognition
  • Multicondition training
  • Noise robustness
  • Recording level

Fingerprint

Dive into the research topics of 'Improving generalisation and robustness of acoustic affect recognition'. Together they form a unique fingerprint.

Cite this