TY - GEN
T1 - Robust vocabulary independent keyword spotting with graphical models
AU - Wöllmer, Martin
AU - Eyben, Florian
AU - Schuller, Björn
AU - Rigoll, Gerhard
PY - 2009
Y1 - 2009
N2 - This paper introduces a novel graphical model architecture for robust and vocabulary independent keyword spotting which does not require the training of an explicit garbage model. We show how a graphical model structure for phoneme recognition can be extended to a keyword spotter that is robust with respect to phoneme recognition errors. We use a hidden garbage variable together with the concept of switching parents to model keywords as well as arbitrary speech. This implies that keywords can be added to the vocabulary without having to re-train the model. Thereby the design of our model architecture is optimised to reliably detect keywords rather than to decode keyword phoneme sequences as arbitrary speech, while offering a parameter to adjust the operating point on the Receiver Operating Characteristics curve. Experiments on the TIMIT corpus reveal that our graphical model outperforms a comparable Hidden Markov Model based keyword spotter that uses conventional garbage modelling.
AB - This paper introduces a novel graphical model architecture for robust and vocabulary independent keyword spotting which does not require the training of an explicit garbage model. We show how a graphical model structure for phoneme recognition can be extended to a keyword spotter that is robust with respect to phoneme recognition errors. We use a hidden garbage variable together with the concept of switching parents to model keywords as well as arbitrary speech. This implies that keywords can be added to the vocabulary without having to re-train the model. Thereby the design of our model architecture is optimised to reliably detect keywords rather than to decode keyword phoneme sequences as arbitrary speech, while offering a parameter to adjust the operating point on the Receiver Operating Characteristics curve. Experiments on the TIMIT corpus reveal that our graphical model outperforms a comparable Hidden Markov Model based keyword spotter that uses conventional garbage modelling.
UR - http://www.scopus.com/inward/record.url?scp=77949350062&partnerID=8YFLogxK
U2 - 10.1109/ASRU.2009.5373544
DO - 10.1109/ASRU.2009.5373544
M3 - Conference contribution
AN - SCOPUS:77949350062
SN - 9781424454792
T3 - Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009
SP - 349
EP - 353
BT - Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009
T2 - 2009 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009
Y2 - 13 December 2009 through 17 December 2009
ER -