TY - GEN
T1 - Switching linear dynamic models for noise robust in-car speech recognition
AU - Schuller, Björn
AU - Wöllmer, Martin
AU - Moosmayr, Tobias
AU - Ruske, Günther
AU - Rigoll, Gerhard
PY - 2008
Y1 - 2008
N2 - Performance of speech recognition systems strongly degrades in the presence of background noise, like the driving noise in the interior of a car. We compare two different Kalman filtering approaches which attempt to improve noise robustness: Switching Linear Dynamic Models (SLDM) and Autoregressive Switching Linear Dynamical Systems (AR-SLDS). Unlike previous works which are restricted on considering white noise, we evaluate the modeling concepts in a noisy speech recognition task where also colored noise produced through different driving conditions and car types is taken into account. Thereby we demonstrate that speech enhancement based on Kalman filtering prevails over all standard de-noising techniques considered herein, such as Wiener filtering, Histogram Equalization, and Unsupervised Spectral Subtraction.
AB - Performance of speech recognition systems strongly degrades in the presence of background noise, like the driving noise in the interior of a car. We compare two different Kalman filtering approaches which attempt to improve noise robustness: Switching Linear Dynamic Models (SLDM) and Autoregressive Switching Linear Dynamical Systems (AR-SLDS). Unlike previous works which are restricted on considering white noise, we evaluate the modeling concepts in a noisy speech recognition task where also colored noise produced through different driving conditions and car types is taken into account. Thereby we demonstrate that speech enhancement based on Kalman filtering prevails over all standard de-noising techniques considered herein, such as Wiener filtering, Histogram Equalization, and Unsupervised Spectral Subtraction.
UR - http://www.scopus.com/inward/record.url?scp=54349121839&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-69321-5_25
DO - 10.1007/978-3-540-69321-5_25
M3 - Conference contribution
AN - SCOPUS:54349121839
SN - 3540693203
SN - 9783540693208
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 244
EP - 253
BT - Pattern Recognition - 30th DAGM Symposium, Proceedings
T2 - 30th DAGM Symposium on Pattern Recognition
Y2 - 10 June 2008 through 13 June 2008
ER -