Switching linear dynamic models for noise robust in-car speech recognition

Björn Schuller, Martin Wöllmer, Tobias Moosmayr, Günther Ruske, Gerhard Rigoll

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Performance of speech recognition systems strongly degrades in the presence of background noise, like the driving noise in the interior of a car. We compare two different Kalman filtering approaches which attempt to improve noise robustness: Switching Linear Dynamic Models (SLDM) and Autoregressive Switching Linear Dynamical Systems (AR-SLDS). Unlike previous works which are restricted on considering white noise, we evaluate the modeling concepts in a noisy speech recognition task where also colored noise produced through different driving conditions and car types is taken into account. Thereby we demonstrate that speech enhancement based on Kalman filtering prevails over all standard de-noising techniques considered herein, such as Wiener filtering, Histogram Equalization, and Unsupervised Spectral Subtraction.

Original languageEnglish
Title of host publicationPattern Recognition - 30th DAGM Symposium, Proceedings
Pages244-253
Number of pages10
DOIs
StatePublished - 2008
Event30th DAGM Symposium on Pattern Recognition - Munich, Germany
Duration: 10 Jun 200813 Jun 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5096 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference30th DAGM Symposium on Pattern Recognition
Country/TerritoryGermany
CityMunich
Period10/06/0813/06/08

Fingerprint

Dive into the research topics of 'Switching linear dynamic models for noise robust in-car speech recognition'. Together they form a unique fingerprint.

Cite this