TY - JOUR
T1 - Recognising realistic emotions and affect in speech
T2 - State of the art and lessons learnt from the first challenge
AU - Schuller, Björn
AU - Batliner, Anton
AU - Steidl, Stefan
AU - Seppi, Dino
PY - 2011/11
Y1 - 2011/11
N2 - More than a decade has passed since research on automatic recognition of emotion from speech has become a new field of research in line with its 'big brothers' speech and speaker recognition. This article attempts to provide a short overview on where we are today, how we got there and what this can reveal us on where to go next and how we could arrive there. In a first part, we address the basic phenomenon reflecting the last fifteen years, commenting on databases, modelling and annotation, the unit of analysis and prototypicality. We then shift to automatic processing including discussions on features, classification, robustness, evaluation, and implementation and system integration. From there we go to the first comparative challenge on emotion recognition from speech - the INTERSPEECH 2009 Emotion Challenge, organised by (part of) the authors, including the description of the Challenge's database, Sub-Challenges, participants and their approaches, the winners, and the fusion of results to the actual learnt lessons before we finally address the ever-lasting problems and future promising attempts.
AB - More than a decade has passed since research on automatic recognition of emotion from speech has become a new field of research in line with its 'big brothers' speech and speaker recognition. This article attempts to provide a short overview on where we are today, how we got there and what this can reveal us on where to go next and how we could arrive there. In a first part, we address the basic phenomenon reflecting the last fifteen years, commenting on databases, modelling and annotation, the unit of analysis and prototypicality. We then shift to automatic processing including discussions on features, classification, robustness, evaluation, and implementation and system integration. From there we go to the first comparative challenge on emotion recognition from speech - the INTERSPEECH 2009 Emotion Challenge, organised by (part of) the authors, including the description of the Challenge's database, Sub-Challenges, participants and their approaches, the winners, and the fusion of results to the actual learnt lessons before we finally address the ever-lasting problems and future promising attempts.
KW - Adaptation
KW - Affect
KW - Automatic classification
KW - Emotion
KW - Evaluation
KW - Feature selection
KW - Feature types
KW - Noise robustness
KW - Standardisation
KW - Usability
UR - http://www.scopus.com/inward/record.url?scp=79960846940&partnerID=8YFLogxK
U2 - 10.1016/j.specom.2011.01.011
DO - 10.1016/j.specom.2011.01.011
M3 - Article
AN - SCOPUS:79960846940
SN - 0167-6393
VL - 53
SP - 1062
EP - 1087
JO - Speech Communication
JF - Speech Communication
IS - 9-10
ER -