TY - GEN
T1 - A multi-modal mixed-state dynamic Bayesian network for robust meeting event recognition from disturbed data
AU - Al-Hames, Marc
AU - Rigoll, Gerhard
PY - 2005
Y1 - 2005
N2 - In this work we present a novel multi-modal mixed-state dynamic Bayesian network (DBN) for robust meeting event classification. The model uses information from lapel microphones, a microphone array and visual information to structure meetings into segments. Within the DBN a multistream hidden Markov model (HMM) is coupled with a linear dynamical system (LDS) to compensate disturbances in the data. Thereby the HMM is used as driving input for the LDS. The model can handle noise and occlusions in all channels. Experimental results on real meeting data show that the new model is highly preferable to all single-stream approaches. Compared to a baseline multi-modal early fusion HMM, the new DBN is more than 2.5%, respectively 1.5% better for clear and disturbed data, this corresponds to a relative error reduction of 17%, respectively 9%.
AB - In this work we present a novel multi-modal mixed-state dynamic Bayesian network (DBN) for robust meeting event classification. The model uses information from lapel microphones, a microphone array and visual information to structure meetings into segments. Within the DBN a multistream hidden Markov model (HMM) is coupled with a linear dynamical system (LDS) to compensate disturbances in the data. Thereby the HMM is used as driving input for the LDS. The model can handle noise and occlusions in all channels. Experimental results on real meeting data show that the new model is highly preferable to all single-stream approaches. Compared to a baseline multi-modal early fusion HMM, the new DBN is more than 2.5%, respectively 1.5% better for clear and disturbed data, this corresponds to a relative error reduction of 17%, respectively 9%.
UR - http://www.scopus.com/inward/record.url?scp=33750540562&partnerID=8YFLogxK
U2 - 10.1109/ICME.2005.1521356
DO - 10.1109/ICME.2005.1521356
M3 - Conference contribution
AN - SCOPUS:33750540562
SN - 0780393325
SN - 9780780393325
T3 - IEEE International Conference on Multimedia and Expo, ICME 2005
SP - 45
EP - 48
BT - IEEE International Conference on Multimedia and Expo, ICME 2005
T2 - IEEE International Conference on Multimedia and Expo, ICME 2005
Y2 - 6 July 2005 through 8 July 2005
ER -