Sparse, hierarchical and semi-supervised base learning for monaural enhancement of conversational speech

Felix Weninger, Martin Wöllmer, Björn Schuller

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We address the learning of noise bases in a monaural speaker-independent speech enhancement framework based on non-negative matrix factorization. Bases are estimated from training data in batch processing by means of hierarchical and non-hierarchical sparse coding, or determined during the speech enhancement process based on the divergence of the observed noisy speech signal and the speech base. In extensive test runs on the Buckeye corpus of highly spontaneous speech and the CHiME corpus of non-stationary real-life noise, we observe that semi-supervised learning of noise bases leads to overall best results while a-priori learning of noise bases is useful to speed up computation.

Original languageEnglish
Title of host publicationSprachkommunikation - 10. ITG-Fachtagung
PublisherVDE VERLAG GMBH
Pages11-14
Number of pages4
ISBN (Electronic)9783800734559
StatePublished - 2020
Event10. ITG-Fachtagung Sprachkommunikation - 10th ITG Conference on Speech Communication - Braunschweig, Germany
Duration: 26 Sep 201228 Sep 2012

Publication series

NameSprachkommunikation - 10. ITG-Fachtagung

Conference

Conference10. ITG-Fachtagung Sprachkommunikation - 10th ITG Conference on Speech Communication
Country/TerritoryGermany
CityBraunschweig
Period26/09/1228/09/12

Fingerprint

Dive into the research topics of 'Sparse, hierarchical and semi-supervised base learning for monaural enhancement of conversational speech'. Together they form a unique fingerprint.

Cite this