TY - CHAP
T1 - Emotion Recognition in Naturalistic Speech and Language-A Survey
AU - Weninger, Felix
AU - Wöllmer, Martin
AU - Schuller, Björn
N1 - Publisher Copyright:
© 2015 John Wiley & Sons, Inc. All rights reserved.
PY - 2015/1/2
Y1 - 2015/1/2
N2 - This chapter provides an overview over recent developments in naturalistic emotion recognition based on acoustic and linguistic cues. It discusses a variety of use-cases where emotion recognition can improve quality of service and quality of life. The chapter describes the existing corpora of emotional speech data relating to such scenarios, the underlying theory of emotion modeling, and the need for an optimal unit of analysis. It focuses on the challenges for real-life applications that have become evident: non-prototypicality; lack of solid ground truth and data sparsity; generalization across application scenarios, languages, and cultures; requirements of real-time and incremental processing; robustness with respect to acoustic conditions; and appropriate evaluation measures that reflect real-life settings. The chapter concludes by giving further directions for the field, including novel strategies to augment training data by synthesis and (semi-)unsupervised learning, as well as joint learning of other paralinguistic features by mutual information exploitation.
AB - This chapter provides an overview over recent developments in naturalistic emotion recognition based on acoustic and linguistic cues. It discusses a variety of use-cases where emotion recognition can improve quality of service and quality of life. The chapter describes the existing corpora of emotional speech data relating to such scenarios, the underlying theory of emotion modeling, and the need for an optimal unit of analysis. It focuses on the challenges for real-life applications that have become evident: non-prototypicality; lack of solid ground truth and data sparsity; generalization across application scenarios, languages, and cultures; requirements of real-time and incremental processing; robustness with respect to acoustic conditions; and appropriate evaluation measures that reflect real-life settings. The chapter concludes by giving further directions for the field, including novel strategies to augment training data by synthesis and (semi-)unsupervised learning, as well as joint learning of other paralinguistic features by mutual information exploitation.
KW - Emotional speech data
KW - Naturalistic emotion recognition
KW - Quality of life
KW - Real-time processing
KW - Unsupervised learning
UR - http://www.scopus.com/inward/record.url?scp=85016363297&partnerID=8YFLogxK
U2 - 10.1002/9781118910566.ch10
DO - 10.1002/9781118910566.ch10
M3 - Chapter
AN - SCOPUS:85016363297
SN - 9781118130667
SP - 237
EP - 267
BT - Emotion Recognition
PB - wiley
ER -