TY - GEN
T1 - Unsupervised learning in cross-corpus acoustic emotion recognition
AU - Zhang, Zixing
AU - Weninger, Felix
AU - Wöllmer, Martin
AU - Schuller, Björn
PY - 2011
Y1 - 2011
N2 - One of the ever-present bottlenecks in Automatic Emotion Recognition is data sparseness. We therefore investigate the suitability of unsupervised learning in cross-corpus acoustic emotion recognition through a large-scale study with six commonly used databases, including acted and natural emotion speech, and covering a variety of application scenarios and acoustic conditions. We show that adding unlabeled emotional speech to agglomerated multi-corpus training sets can enhance recognition performance even in a challenging cross-corpus setting; furthermore, we show that the expected gain by adding unlabeled data on average is approximately half the one achieved by additional manually labeled data in leave-one-corpus-out validation.
AB - One of the ever-present bottlenecks in Automatic Emotion Recognition is data sparseness. We therefore investigate the suitability of unsupervised learning in cross-corpus acoustic emotion recognition through a large-scale study with six commonly used databases, including acted and natural emotion speech, and covering a variety of application scenarios and acoustic conditions. We show that adding unlabeled emotional speech to agglomerated multi-corpus training sets can enhance recognition performance even in a challenging cross-corpus setting; furthermore, we show that the expected gain by adding unlabeled data on average is approximately half the one achieved by additional manually labeled data in leave-one-corpus-out validation.
KW - speech emotion recognition
KW - unsupervised learning
UR - http://www.scopus.com/inward/record.url?scp=84858985413&partnerID=8YFLogxK
U2 - 10.1109/ASRU.2011.6163986
DO - 10.1109/ASRU.2011.6163986
M3 - Conference contribution
AN - SCOPUS:84858985413
SN - 9781467303675
T3 - 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings
SP - 523
EP - 528
BT - 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings
T2 - 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011
Y2 - 11 December 2011 through 15 December 2011
ER -