Abstract
This paper proposes a method for 3D whole-body motion recovery and motion recognition from a sequence of occluded monocular camera images based on statistical inference using a motion database. In the motion database, each motion primitive (e.g., walk, kick, etc.) is represented in an abstract statistical form. Instead of extracting rich information by expensive computation of image processing, we propose an inference mechanism from low level image features (e.g., optical flow), inspired by psychological research on how humans perceive motion. The proposed inference mechanism recovers the 3D body configuration and finds the closest motion primitive in the motion database. Observations in 2D camera image space can be recognized even though the motion database is prepared in a different space (such as joint space) by coordinate transformation of the statistical motion representation. The approach is view invariant since the demonstrator's baselink position and orientation with respect to camera coordinates are tracked using an extended particle filter. Finally, an experimental evaluation of the presented concepts using a 56-degree-of-freedom articulated human model is discussed.
Original language | English |
---|---|
Pages (from-to) | 818-832 |
Number of pages | 15 |
Journal | Robotics and Autonomous Systems |
Volume | 62 |
Issue number | 6 |
DOIs | |
State | Published - Jun 2014 |
Externally published | Yes |
Keywords
- Monocular vision
- Motion capturing
- Motion recognition
- Motion recovery
- Optical flow
- Particle filter
- Statistical inference