TY - GEN
T1 - Combining Bottleneck-BLSTM and semi-supervised sparse NMF for recognition of conversational speech in highly instationary noise
AU - Weninger, Felix
AU - Wöllmer, Martin
AU - Schuller, Björn
PY - 2012
Y1 - 2012
N2 - We address the speaker independent automatic recognition of spontaneous speech in highly variable noise by applying semi-supervised sparse non-negative matrix factorization (NMF) for speech enhancement coupled with our recently proposed frontend utilizing bottleneck (BN) features generated by a bidirectional Long Short-Term Memory (BLSTM) recurrent neural network. In our evaluation, we unite the noise corpus and evaluation protocol of the 2011 PASCAL CHiME challenge with the Buckeye database, and we demonstrate that the combination of NMF enhancement and BN-BLSTM front-end introduces significant and consistent gains in word accuracy in this highly challenging task at signal-to-noise ratios from -6 to 9 dB.
AB - We address the speaker independent automatic recognition of spontaneous speech in highly variable noise by applying semi-supervised sparse non-negative matrix factorization (NMF) for speech enhancement coupled with our recently proposed frontend utilizing bottleneck (BN) features generated by a bidirectional Long Short-Term Memory (BLSTM) recurrent neural network. In our evaluation, we unite the noise corpus and evaluation protocol of the 2011 PASCAL CHiME challenge with the Buckeye database, and we demonstrate that the combination of NMF enhancement and BN-BLSTM front-end introduces significant and consistent gains in word accuracy in this highly challenging task at signal-to-noise ratios from -6 to 9 dB.
UR - http://www.scopus.com/inward/record.url?scp=84878390904&partnerID=8YFLogxK
U2 - 10.21437/interspeech.2012-108
DO - 10.21437/interspeech.2012-108
M3 - Conference contribution
AN - SCOPUS:84878390904
SN - 9781622767595
T3 - 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
SP - 302
EP - 305
BT - 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
PB - International Speech Communication Association
T2 - 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Y2 - 9 September 2012 through 13 September 2012
ER -