DEMoS: an Italian emotional speech corpus: Elicitation methods, machine learning, and perception

Emilia Parada-Cabaleiro, Giovanni Costantini, Anton Batliner, Maximilian Schmitt, Björn W. Schuller

Publikation: Beitrag in FachzeitschriftArtikelBegutachtung

41 Zitate (Scopus)

Abstract

We present DEMoS (Database of Elicited Mood in Speech), a new, large database with Italian emotional speech: 68 speakers, some 9 k speech samples.As Italian is under-represented in speech emotion research, for a comparison with the state-of-the-art, we model the ‘big 6 emotions’ and guilt. Besides making available this database for research, our contribution is three-fold: First, we employ a variety of mood induction procedures, whose combinations are especially tailored for specific emotions.Second, we use combinations of selection procedures such as an alexithymia test and self- and external assessment, obtaining 1,5 k (proto-) typical samples; these were used in a perception test (86 native Italian subjects, categorical identification and dimensional rating). Third, machine learning techniques—based on standardised brute-forced openSMILE ComParE features and support vector machine classifiers—were applied to assess how emotional typicality and sample size might impact machine learning efficiency.Our results are three-fold as well: First, we show that appropriate induction techniques ensure the collection of valid samples, whereas the type of self-assessment employed turned out not to be a meaningful measurement. Second, emotional typicality—which shows up in an acoustic analysis of prosodic main features—in contrast to sample size is not an essential feature for successfully training machine learning models. Third, the perceptual findings demonstrate that the confusion patterns mostly relate to cultural rules and to ambiguous emotions.

OriginalspracheEnglisch
Seiten (von - bis)341-383
Seitenumfang43
FachzeitschriftLanguage Resources and Evaluation
Jahrgang54
Ausgabenummer2
DOIs
PublikationsstatusVeröffentlicht - 1 Juni 2020
Extern publiziertJa

Fingerprint

Untersuchen Sie die Forschungsthemen von „DEMoS: an Italian emotional speech corpus: Elicitation methods, machine learning, and perception“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren