Tendencies regarding the effect of emotional intensity in inter corpus phoneme-level speech emotion modelling

Bogdan Vlasenko, Bjorn Schuller, Andreas Wendemuth

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

As emotion recognition from speech has matured to a degree where it becomes suitable for real-life applications, it is time for developing techniques for matching different types of emotional data with multi-dimensional and categories-based annotations. The categorical approach is usually applied for acted 'full blown' emotions and multi-dimensional annotation is often preferred for spontaneous real life emotions. A particularly realistic task we consider in this contribution is cross-corpus emotion recognition and its evaluation. General and phoneme-level emotional models on acted and spontaneous emotions ('very intense' and 'intense') are used in our experimental study. The emotional models were trained on spontaneous emotions from the complete VAM dataset and subsets with variable emotional intensities and evaluated on acted emotions from the Berlin EMO-DB dataset. We observe a significant classification performance gap for general models trained on very intense spontaneous emotions. As a consequence, we address the importance of collecting large corpora with very intense emotional content for training more reliable phoneme-level emotional models.

Original languageEnglish
Title of host publication2016 IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2016 - Proceedings
EditorsKostas Diamantaras, Aurelio Uncini, Francesco A. N. Palmieri, Jan Larsen
PublisherIEEE Computer Society
ISBN (Electronic)9781509007462
DOIs
StatePublished - 8 Nov 2016
Externally publishedYes
Event26th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2016 - Proceedings - Vietri sul Mare, Salerno, Italy
Duration: 13 Sep 201616 Sep 2016

Publication series

NameIEEE International Workshop on Machine Learning for Signal Processing, MLSP
Volume2016-November
ISSN (Print)2161-0363
ISSN (Electronic)2161-0371

Conference

Conference26th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2016 - Proceedings
Country/TerritoryItaly
CityVietri sul Mare, Salerno
Period13/09/1616/09/16

Keywords

  • cross-corpus evaluation
  • emotion recognition
  • emotional intensity
  • phoneme-level emotional models
  • turn-level emotional models

Fingerprint

Dive into the research topics of 'Tendencies regarding the effect of emotional intensity in inter corpus phoneme-level speech emotion modelling'. Together they form a unique fingerprint.

Cite this