AVEC 2012 - The continuous audio/visual emotion challenge

Björn Schuller, Michel Valstar, Florian Eyben, Roddy Cowie, Maja Pantic

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

175 Scopus citations

Abstract

We present the second Audio-Visual Emotion recognition Challenge and workshop (AVEC 2012), which aims to bring together researchers from the audio and video analysis communities around the topic of emotion recognition. The goal of the challenge is to recognise four continuously valued affective dimensions: arousal, expectancy, power, and valence. There are two sub-challenges: in the Fully Continuous Sub- Challenge participants have to predict the values of the four dimensions at every moment during the recordings, while for the Word-Level Sub-Challenge a single prediction has to be given per word uttered by the user. This paper presents the challenge guidelines, the common data used, and the performance of the baseline system on the two tasks.

Original languageEnglish
Title of host publicationICMI'12 - Proceedings of the ACM International Conference on Multimodal Interaction
Pages449-456
Number of pages8
DOIs
StatePublished - 2012
Externally publishedYes
Event14th ACM International Conference on Multimodal Interaction, ICMI 2012 - Santa Monica, CA, United States
Duration: 22 Oct 201226 Oct 2012

Publication series

NameICMI'12 - Proceedings of the ACM International Conference on Multimodal Interaction

Conference

Conference14th ACM International Conference on Multimodal Interaction, ICMI 2012
Country/TerritoryUnited States
CitySanta Monica, CA
Period22/10/1226/10/12

Keywords

  • Affective computing
  • Challenge
  • Emotion recognition
  • Facial expression
  • Speech

Fingerprint

Dive into the research topics of 'AVEC 2012 - The continuous audio/visual emotion challenge'. Together they form a unique fingerprint.

Cite this