TY - GEN
T1 - Multimodal integration for meeting group action segmentation and recognition
AU - Al-Hames, Marc
AU - Dielmann, Alfred
AU - Gatica-Perez, Daniel
AU - Reiter, Stephan
AU - Renais, Steve
AU - Rigoll, Gerhard
AU - Zhang, Pong
PY - 2006
Y1 - 2006
N2 - We address the problem of segmentation and recognition of sequences of multimodal human interactions in meetings. These interactions can be seen as a rough structure of a meeting, and can be used either as input for a meeting browser or as a first step towards a higher semantic analysis of the meeting. A common lexicon of multimodal group meeting actions, a shared meeting data set, and a common evaluation procedure enable us to compare the different approaches. We compare three different multimodal feature sets and our modelling infrastructures: a higher semantic feature approach, multi-layer HMMs, a multi-stream DBN, as well as a multi-stream mixed-state DBN for disturbed data.
AB - We address the problem of segmentation and recognition of sequences of multimodal human interactions in meetings. These interactions can be seen as a rough structure of a meeting, and can be used either as input for a meeting browser or as a first step towards a higher semantic analysis of the meeting. A common lexicon of multimodal group meeting actions, a shared meeting data set, and a common evaluation procedure enable us to compare the different approaches. We compare three different multimodal feature sets and our modelling infrastructures: a higher semantic feature approach, multi-layer HMMs, a multi-stream DBN, as well as a multi-stream mixed-state DBN for disturbed data.
UR - http://www.scopus.com/inward/record.url?scp=33745574732&partnerID=8YFLogxK
U2 - 10.1007/11677482_5
DO - 10.1007/11677482_5
M3 - Conference contribution
AN - SCOPUS:33745574732
SN - 3540325492
SN - 9783540325499
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 52
EP - 63
BT - Machine Learning for Multimodal Interaction - Second International Workshop, MLMI 2005, Revised Selected Papers
T2 - 2nd International Workshop on Machine Learning for Multimodal Interaction, MLMI 2005
Y2 - 11 July 2005 through 13 July 2005
ER -