Fully automatic audiovisual emotion recognition: Voice, words, and the face

Martin Wöllmer, Moritz Kaiser, Florian Eyben, Felix Weninger, Björn Schuller, Gerhard Rigoll

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The recognition of human emotions from spontaneous and non-prototypical real-life data is currently one of the most challenging tasks in the field of affective computing. This contribution presents our recent advances in assessing dimensional representations of emotion, such as arousal, expectation, power, and valence, in an audiovisual human-computer interaction scenario. We propose a fully automatic multimodal recognition approach based on context-sensitive modeling of audio and video features. Evaluations on the Audiovisual Sub-Challenge of the 2011 Audio/Visual Emotion Challenge show how accurately different affective dimensions can be recognized. Our experiments reveal that the proposed multimodal recognition system outperforms previously introduced techniques evaluated on the same task.

Original languageEnglish
Title of host publicationSprachkommunikation - 10. ITG-Fachtagung
PublisherVDE VERLAG GMBH
Pages71-74
Number of pages4
ISBN (Electronic)9783800734559
StatePublished - 2020
Event10. ITG-Fachtagung Sprachkommunikation - 10th ITG Conference on Speech Communication - Braunschweig, Germany
Duration: 26 Sep 201228 Sep 2012

Publication series

NameSprachkommunikation - 10. ITG-Fachtagung

Conference

Conference10. ITG-Fachtagung Sprachkommunikation - 10th ITG Conference on Speech Communication
Country/TerritoryGermany
CityBraunschweig
Period26/09/1228/09/12

Fingerprint

Dive into the research topics of 'Fully automatic audiovisual emotion recognition: Voice, words, and the face'. Together they form a unique fingerprint.

Cite this