Three dimensional object tracking based on audiovisual fusion using particle swarm optimization

Fakheredine Keyrouz, Ulrich Kirchmaier, Klaus Diepold

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

In this paper a new method of detecting and tracking a human person in three dimensional space using audio and video data is proposed. A simple tracking system with two microphones and stereo vision is introduced. The audio information is resulting from the Generalized Cross Correlation (GCC) algorithm, and the video information is extracted by the Continuously Adaptive Mean shift (CAMshift) method. The localization estimates delivered by these two systems are then combined using a novel Particle Swarm Optimization (PSO) fusion technique. In our approach the particles move in the 3D space and iteratively evaluate their current position with regard to the localization estimates of the audio and video module. This facilitates the direct determination of the object's three dimensional position. Compared to existing methods, this novel technique achieves faster tracking performance while being independent of any kind of model, statistic, or assumption. audiovisual fusion, generalized cross correlation, object tracking, particle swarm optimization, stereo vision.

Original languageEnglish
Title of host publicationProceedings of the 11th International Conference on Information Fusion, FUSION 2008
DOIs
StatePublished - 2008
Event11th International Conference on Information Fusion, FUSION 2008 - Cologne, Germany
Duration: 30 Jun 20083 Jul 2008

Publication series

NameProceedings of the 11th International Conference on Information Fusion, FUSION 2008

Conference

Conference11th International Conference on Information Fusion, FUSION 2008
Country/TerritoryGermany
CityCologne
Period30/06/083/07/08

Fingerprint

Dive into the research topics of 'Three dimensional object tracking based on audiovisual fusion using particle swarm optimization'. Together they form a unique fingerprint.

Cite this