TY - GEN
T1 - A multi-modal graphical model for robust recognition of group actions in meetings from disturbed videos
AU - Al-Hames, Marc
AU - Rigoll, Gerhard
PY - 2005
Y1 - 2005
N2 - In this work we present a novel multi-modal mixed-state dynamic Bayesian network (DBN) for robust meeting event classification from disturbed videos. The model uses information from the audio and the visual channel to structure meetings into segments. Within the DBN a multi-stream hidden Markov model (HMM) is coupled with a linear dynamical system (LDS) to compensate disturbances in the visual channel. Thereby the HMM is used as driving input for the LDS. Thus the model can handle noise and occlusions in the video. Experimental results on real meeting data show that the new model is highly preferable to all single-stream approaches. Compared to a baseline multi-modal early fusion HMM, the new DBN is 3.5%, respectively up to 6.1% better for clear and visual disturbed data, this corresponds to a relative error reduction of 23.6%, respectively 29.9%.
AB - In this work we present a novel multi-modal mixed-state dynamic Bayesian network (DBN) for robust meeting event classification from disturbed videos. The model uses information from the audio and the visual channel to structure meetings into segments. Within the DBN a multi-stream hidden Markov model (HMM) is coupled with a linear dynamical system (LDS) to compensate disturbances in the visual channel. Thereby the HMM is used as driving input for the LDS. Thus the model can handle noise and occlusions in the video. Experimental results on real meeting data show that the new model is highly preferable to all single-stream approaches. Compared to a baseline multi-modal early fusion HMM, the new DBN is 3.5%, respectively up to 6.1% better for clear and visual disturbed data, this corresponds to a relative error reduction of 23.6%, respectively 29.9%.
UR - http://www.scopus.com/inward/record.url?scp=33749247613&partnerID=8YFLogxK
U2 - 10.1109/ICIP.2005.1530418
DO - 10.1109/ICIP.2005.1530418
M3 - Conference contribution
AN - SCOPUS:33749247613
SN - 0780391349
SN - 9780780391345
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 421
EP - 424
BT - IEEE International Conference on Image Processing 2005, ICIP 2005
T2 - IEEE International Conference on Image Processing 2005, ICIP 2005
Y2 - 11 September 2005 through 14 September 2005
ER -