Convolutive non-negative sparse coding and new features for speech overlap handling in speaker diarization

Jürgen T. Geiger, Ravichander Vipperla, Simon Bozonnet, Nicholas Evans, Björn Schuller, Gerhard Rigoll

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

14 Zitate (Scopus)

Abstract

The effective handling of overlapping speech is at the limits of the current state of the art in speaker diarization. This paper presents our latest work in overlap detection. We report the combination of features derived through convolutive nonnegative sparse coding and new energy, spectral and voicingrelated features within a conventional HMM system. Overlap detection results are fully integrated into our top-down diarization system through the application of overlap exclusion and overlap labeling. Experiments on a subset of the AMI corpus show that the new system delivers significant reductions in missed speech and speaker error. Through overlap exclusion and labelling the overall diarization error rate is shown to improve by 6.4 % relative.

OriginalspracheEnglisch
Titel13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Seiten2151-2154
Seitenumfang4
PublikationsstatusVeröffentlicht - 2012
Veranstaltung13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 - Portland, OR, USA/Vereinigte Staaten
Dauer: 9 Sept. 201213 Sept. 2012

Publikationsreihe

Name13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Band3

Konferenz

Konferenz13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Land/GebietUSA/Vereinigte Staaten
OrtPortland, OR
Zeitraum9/09/1213/09/12

Fingerprint

Untersuchen Sie die Forschungsthemen von „Convolutive non-negative sparse coding and new features for speech overlap handling in speaker diarization“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren