TY - GEN
T1 - Robust multi-stream keyword and non-linguistic vocalization detection for computationally intelligent virtual agents
AU - Wöllmer, Martin
AU - Marchi, Erik
AU - Squartini, Stefano
AU - Schuller, Björn
PY - 2011
Y1 - 2011
N2 - Systems for keyword and non-linguistic vocalization detection in conversational agent applications need to be robust with respect to background noise and different speaking styles. Focussing on the Sensitive Artificial Listener (SAL) scenario which involves spontaneous, emotionally colored speech, this paper proposes a multi-stream model that applies the principle of Long Short-Term Memory to generate context-sensitive phoneme predictions which can be used for keyword detection. Further, we investigate the incorporation of noisy training material in order to create noise robust acoustic models. We show that both strategies can improve recognition performance when evaluated on spontaneous human-machine conversations as contained in the SEMAINE database.
AB - Systems for keyword and non-linguistic vocalization detection in conversational agent applications need to be robust with respect to background noise and different speaking styles. Focussing on the Sensitive Artificial Listener (SAL) scenario which involves spontaneous, emotionally colored speech, this paper proposes a multi-stream model that applies the principle of Long Short-Term Memory to generate context-sensitive phoneme predictions which can be used for keyword detection. Further, we investigate the incorporation of noisy training material in order to create noise robust acoustic models. We show that both strategies can improve recognition performance when evaluated on spontaneous human-machine conversations as contained in the SEMAINE database.
KW - Conversational agents
KW - keyword spotting
KW - long short-term memory
KW - multi-condition training
UR - http://www.scopus.com/inward/record.url?scp=79957788667&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-21090-7_58
DO - 10.1007/978-3-642-21090-7_58
M3 - Conference contribution
AN - SCOPUS:79957788667
SN - 9783642210891
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 496
EP - 505
BT - Advances in Neural Networks - 8th International Symposium on Neural Networks, ISNN 2011
T2 - 8th International Symposium on Neural Networks, ISNN 2011
Y2 - 29 May 2011 through 1 June 2011
ER -