Acoustic-linguistic recognition of interest in speech with bottleneck-BLSTM nets

Martin Wöllmer, Felix Weninger, Florian Eyben, Björn Schuller

Research output: Contribution to journalConference articlepeer-review

9 Scopus citations

Abstract

This paper proposes a novel technique for speech-based interest recognition in natural conversations. We introduce a fully automatic system that exploits the principle of bidirectional Long Short-Term Memory (BLSTM) as well as the structure of socalled bottleneck networks. BLSTM nets are able to model a self-learned amount of context information, which was shown to be beneficial for affect recognition applications, while bottleneck networks allow for efficient feature compression within neural networks. In addition to acoustic features, our technique considers linguistic information obtained from a multi-stream BLSTM-HMM speech recognizer. Evaluations on the TUM AVIC corpus reveal that the bottleneck-BLSTM method prevails over all approaches that have been proposed for the Interspeech 2010 Paralinguistic Challenge task.

Original languageEnglish
Pages (from-to)77-80
Number of pages4
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
StatePublished - 2011
Event12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011 - Florence, Italy
Duration: 27 Aug 201131 Aug 2011

Keywords

  • Affective computing
  • Interest recognition
  • Recurrent neural networks

Fingerprint

Dive into the research topics of 'Acoustic-linguistic recognition of interest in speech with bottleneck-BLSTM nets'. Together they form a unique fingerprint.

Cite this