DEMoS: an Italian emotional speech corpus: Elicitation methods, machine learning, and perception

Emilia Parada-Cabaleiro, Giovanni Costantini, Anton Batliner, Maximilian Schmitt, Björn W. Schuller

Research output: Contribution to journalArticlepeer-review

40 Scopus citations

Abstract

We present DEMoS (Database of Elicited Mood in Speech), a new, large database with Italian emotional speech: 68 speakers, some 9 k speech samples.As Italian is under-represented in speech emotion research, for a comparison with the state-of-the-art, we model the ‘big 6 emotions’ and guilt. Besides making available this database for research, our contribution is three-fold: First, we employ a variety of mood induction procedures, whose combinations are especially tailored for specific emotions.Second, we use combinations of selection procedures such as an alexithymia test and self- and external assessment, obtaining 1,5 k (proto-) typical samples; these were used in a perception test (86 native Italian subjects, categorical identification and dimensional rating). Third, machine learning techniques—based on standardised brute-forced openSMILE ComParE features and support vector machine classifiers—were applied to assess how emotional typicality and sample size might impact machine learning efficiency.Our results are three-fold as well: First, we show that appropriate induction techniques ensure the collection of valid samples, whereas the type of self-assessment employed turned out not to be a meaningful measurement. Second, emotional typicality—which shows up in an acoustic analysis of prosodic main features—in contrast to sample size is not an essential feature for successfully training machine learning models. Third, the perceptual findings demonstrate that the confusion patterns mostly relate to cultural rules and to ambiguous emotions.

Original languageEnglish
Pages (from-to)341-383
Number of pages43
JournalLanguage Resources and Evaluation
Volume54
Issue number2
DOIs
StatePublished - 1 Jun 2020
Externally publishedYes

Keywords

  • Elicitation
  • Emotional speech
  • Italian corpus
  • Machine learning
  • Mood induction procedures
  • Prototype

Fingerprint

Dive into the research topics of 'DEMoS: an Italian emotional speech corpus: Elicitation methods, machine learning, and perception'. Together they form a unique fingerprint.

Cite this