Skip to main navigation Skip to search Skip to main content

Active learning by label uncertainty for acoustic emotion recognition

  • Technical University of Munich
  • Universität Passau

Research output: Contribution to journalConference articlepeer-review

16 Scopus citations

Abstract

Speech data is in principle available in large amounts for the training of acoustic emotion recognisers. However, emotional labelling is usually not given and the distribution is heavily unbalanced, as most data is 'rather neutral' than truly 'emotional'. In the 'hay stack' of speech data, Active Learning automatically identifies the 'needles', i.e., the more informative instances to reduce human labelling effort when building a classifier, e.g., for acoustic emotion recognition. The critical issue thus is the determination and quantification of informativeness. To this end, we suggest to exploit the reliability of the usual ambiguity of emotional labels, i.e., we propose a novel approach based on label uncertainty. By building a certainty model and predicting the candidate instances, informativeness is thus based on labeller agreement. In addition, we consider class sparseness. The results of extensive test runs under well standardised conditions show the method's great potential in reducing labelling costs while boosting performance.

Original languageEnglish
Pages (from-to)2856-2860
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
StatePublished - 2013
Event14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013 - Lyon, France
Duration: 25 Aug 201329 Aug 2013

Keywords

  • Active learning
  • Class sparseness
  • Confidence values
  • Label uncertainty

Fingerprint

Dive into the research topics of 'Active learning by label uncertainty for acoustic emotion recognition'. Together they form a unique fingerprint.

Cite this