Real-time framework for multimodal human-robot interaction

Jürgen Gast, Alexander Bannat, Tobias Rehrl, Frank Wallhoff, Gerhard Rigoll, Cornelia Wendt, Sabrina Schmidt, Michael Popp, Berthold Färber

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Scopus citations

Abstract

This paper presents a new framework for multimodal data processing in real-time. This framework comprises modules for different input and output signals and was designed for human-human or human-robot interaction scenarios. Single modules for the recording of selected channels like speech, gestures or mimics can be combined with different output options (i.e. robot reactions) in a highly flexible manner. Depending on the included modules, online as well as offline data processing is possible. This framework was used to analyze human-human interaction to gain insights on important factors and their dynamics. Recorded data comprises speech, facial expressions, gestures and physiological data. This naturally produced data was annotated and labeled in order to train recognition modules which will be integrated into the existing framework. The overall aim is to create a system that is able to recognize and react to those parameters that humans take into account during interaction. In this paper, the technical implementation and application in a human-human and a human-robot interaction scenario is presented.

Original languageEnglish
Title of host publicationProceedings - 2009 2nd Conference on Human System Interactions, HSI '09
Pages276-283
Number of pages8
DOIs
StatePublished - 2009
Event2009 2nd Conference on Human System Interactions, HSI '09 - Catania, Italy
Duration: 21 May 200923 May 2009

Publication series

NameProceedings - 2009 2nd Conference on Human System Interactions, HSI '09

Conference

Conference2009 2nd Conference on Human System Interactions, HSI '09
Country/TerritoryItaly
CityCatania
Period21/05/0923/05/09

Fingerprint

Dive into the research topics of 'Real-time framework for multimodal human-robot interaction'. Together they form a unique fingerprint.

Cite this