Temporal and situational context modeling for improved dominance recognition in meetings

Martin Wöllmer, Florian Eyben, Björn Schuller, Gerhard Rigoll

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

4 Zitate (Scopus)

Abstract

We present and evaluate a novel approach towards automatically detecting a speaker's level of dominance in a meeting scenario. Since previous studies reveal that audio appears to be the most important modality for dominance recognition, we focus on the analysis of the speech signals recorded in multiparty meetings. Unlike recently published techniques which concentrate on frame-level hidden Markov modeling, we propose a recognition framework operating on segmental data and investigate context modeling on three different levels to explore possible performance gains. First, we apply a set of statistical functionals to capture large-scale feature-level context within a speech segment. Second, we consider bidirectional Long Short-Term Memory recurrent neural networks for long-range temporal context modeling between segments. Finally, we evaluate the benefit of situational context incorporation by simultaneously modeling speech of all meeting participants. Overall, our approach leads to a remarkable increase of recognition accuracy when compared to hidden Markov modeling.

OriginalspracheEnglisch
Titel13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Seiten350-353
Seitenumfang4
PublikationsstatusVeröffentlicht - 2012
Veranstaltung13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 - Portland, OR, USA/Vereinigte Staaten
Dauer: 9 Sept. 201213 Sept. 2012

Publikationsreihe

Name13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Band1

Konferenz

Konferenz13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Land/GebietUSA/Vereinigte Staaten
OrtPortland, OR
Zeitraum9/09/1213/09/12

Fingerprint

Untersuchen Sie die Forschungsthemen von „Temporal and situational context modeling for improved dominance recognition in meetings“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren