TY - GEN
T1 - Acoustic emotion recognition
T2 - 2009 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009
AU - Schuller, Björn
AU - Vlasenko, Bogdan
AU - Eyben, Florian
AU - Rigoll, Gerhard
AU - Wendemuth, Andreas
PY - 2009
Y1 - 2009
N2 - In the light of the first challenge on emotion recognition from speech we provide the largest-to-date benchmark comparison under equal conditions on nine standard corpora in the field using the two pre-dominant paradigms: modeling on a frame-level by means of Hidden Markov Models and supra-segmental modeling by systematic feature brute-forcing. Investigated corpora are the ABC, AVIC, DES, EMO-DB, eNTERFACE, SAL, SmartKom, SUSAS, and VAM databases. To provide better comparability among sets, we additionally cluster each database's emotions into binary valence and arousal discrimination tasks. In the result large differences are found among corpora that mostly stem from naturalistic emotions and spontaneous speech vs. more prototypical events. Further, supra-segmental modeling proves significantly beneficial on average when several classes are addressed at a time.
AB - In the light of the first challenge on emotion recognition from speech we provide the largest-to-date benchmark comparison under equal conditions on nine standard corpora in the field using the two pre-dominant paradigms: modeling on a frame-level by means of Hidden Markov Models and supra-segmental modeling by systematic feature brute-forcing. Investigated corpora are the ABC, AVIC, DES, EMO-DB, eNTERFACE, SAL, SmartKom, SUSAS, and VAM databases. To provide better comparability among sets, we additionally cluster each database's emotions into binary valence and arousal discrimination tasks. In the result large differences are found among corpora that mostly stem from naturalistic emotions and spontaneous speech vs. more prototypical events. Further, supra-segmental modeling proves significantly beneficial on average when several classes are addressed at a time.
UR - http://www.scopus.com/inward/record.url?scp=77949395673&partnerID=8YFLogxK
U2 - 10.1109/ASRU.2009.5372886
DO - 10.1109/ASRU.2009.5372886
M3 - Conference contribution
AN - SCOPUS:77949395673
SN - 9781424454792
T3 - Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009
SP - 552
EP - 557
BT - Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009
Y2 - 13 December 2009 through 17 December 2009
ER -