Unsupervised learning in cross-corpus acoustic emotion recognition

Zixing Zhang, Felix Weninger, Martin Wöllmer, Björn Schuller

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

102 Scopus citations

Abstract

One of the ever-present bottlenecks in Automatic Emotion Recognition is data sparseness. We therefore investigate the suitability of unsupervised learning in cross-corpus acoustic emotion recognition through a large-scale study with six commonly used databases, including acted and natural emotion speech, and covering a variety of application scenarios and acoustic conditions. We show that adding unlabeled emotional speech to agglomerated multi-corpus training sets can enhance recognition performance even in a challenging cross-corpus setting; furthermore, we show that the expected gain by adding unlabeled data on average is approximately half the one achieved by additional manually labeled data in leave-one-corpus-out validation.

Original languageEnglish
Title of host publication2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings
Pages523-528
Number of pages6
DOIs
StatePublished - 2011
Event2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011 - Waikoloa, HI, United States
Duration: 11 Dec 201115 Dec 2011

Publication series

Name2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings

Conference

Conference2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011
Country/TerritoryUnited States
CityWaikoloa, HI
Period11/12/1115/12/11

Keywords

  • speech emotion recognition
  • unsupervised learning

Fingerprint

Dive into the research topics of 'Unsupervised learning in cross-corpus acoustic emotion recognition'. Together they form a unique fingerprint.

Cite this