A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams

Martin Wöllmer, Marc Al-Hames, Florian Eyben, Björn Schuller, Gerhard Rigoll

Research output: Contribution to journalArticlepeer-review

65 Scopus citations

Abstract

To overcome the computational complexity of the asynchronous hidden Markov model (AHMM), we present a novel multidimensional dynamic time warping (DTW) algorithm for hybrid fusion of asynchronous data. We show that our newly introduced multidimensional DTW concept requires significantly less decoding time while providing the same data fusion flexibility as the AHMM. Thus, it can be applied in a wide range of real-time multimodal classification tasks. Optimally exploiting mutual information during decoding even if the input streams are not synchronous, our algorithm outperforms late and early fusion techniques in a challenging bimodal speech and gesture fusion experiment.

Original languageEnglish
Pages (from-to)366-380
Number of pages15
JournalNeurocomputing
Volume73
Issue number1-3
DOIs
StatePublished - Jan 2009

Keywords

  • Asynchronous hidden Markov model
  • Dynamic time warping
  • Multimodal data fusion

Fingerprint

Dive into the research topics of 'A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams'. Together they form a unique fingerprint.

Cite this