Fully automatic audiovisual emotion recognition: Voice, words, and the face

Martin Wöllmer, Moritz Kaiser, Florian Eyben, Felix Weninger, Björn Schuller, Gerhard Rigoll

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

Abstract

The recognition of human emotions from spontaneous and non-prototypical real-life data is currently one of the most challenging tasks in the field of affective computing. This contribution presents our recent advances in assessing dimensional representations of emotion, such as arousal, expectation, power, and valence, in an audiovisual human-computer interaction scenario. We propose a fully automatic multimodal recognition approach based on context-sensitive modeling of audio and video features. Evaluations on the Audiovisual Sub-Challenge of the 2011 Audio/Visual Emotion Challenge show how accurately different affective dimensions can be recognized. Our experiments reveal that the proposed multimodal recognition system outperforms previously introduced techniques evaluated on the same task.

OriginalspracheEnglisch
TitelSprachkommunikation - 10. ITG-Fachtagung
Herausgeber (Verlag)VDE VERLAG GMBH
Seiten71-74
Seitenumfang4
ISBN (elektronisch)9783800734559
PublikationsstatusVeröffentlicht - 2020
Veranstaltung10. ITG-Fachtagung Sprachkommunikation - 10th ITG Conference on Speech Communication - Braunschweig, Deutschland
Dauer: 26 Sept. 201228 Sept. 2012

Publikationsreihe

NameSprachkommunikation - 10. ITG-Fachtagung

Konferenz

Konferenz10. ITG-Fachtagung Sprachkommunikation - 10th ITG Conference on Speech Communication
Land/GebietDeutschland
OrtBraunschweig
Zeitraum26/09/1228/09/12

Fingerprint

Untersuchen Sie die Forschungsthemen von „Fully automatic audiovisual emotion recognition: Voice, words, and the face“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren