TY - JOUR
T1 - Learning Occupancy Priors of Human Motion from Semantic Maps of Urban Environments
AU - Rudenko, Andrey
AU - Palmieri, Luigi
AU - Doellinger, Johannes
AU - Lilienthal, Achim J.
AU - Arras, Kai O.
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2021/4
Y1 - 2021/4
N2 - Understanding and anticipating human activity is an important capability for intelligent systems in mobile robotics, autonomous driving, and video surveillance. While learning from demonstrations with on-site collected trajectory data is a powerful approach to discover recurrent motion patterns, generalization to new environments, where sufficient motion data are not readily available, remains a challenge. In many cases, however, semantic information about the environment is a highly informative cue for the prediction of pedestrian motion or the estimation of collision risks. In this work, we infer occupancy priors of human motion using only semantic environment information as input. To this end, we apply and discuss a traditional Inverse Optimal Control approach, and propose a novel approach based on Convolutional Neural Networks (CNN) to predict future occupancy maps. Our CNN method produces flexible context-Aware occupancy estimations for semantically uniform map regions and generalizes well already with small amounts of training data. Evaluated on synthetic and real-world data, it shows superior results compared to several baselines, marking a qualitative step-up in semantic environment assessment.
AB - Understanding and anticipating human activity is an important capability for intelligent systems in mobile robotics, autonomous driving, and video surveillance. While learning from demonstrations with on-site collected trajectory data is a powerful approach to discover recurrent motion patterns, generalization to new environments, where sufficient motion data are not readily available, remains a challenge. In many cases, however, semantic information about the environment is a highly informative cue for the prediction of pedestrian motion or the estimation of collision risks. In this work, we infer occupancy priors of human motion using only semantic environment information as input. To this end, we apply and discuss a traditional Inverse Optimal Control approach, and propose a novel approach based on Convolutional Neural Networks (CNN) to predict future occupancy maps. Our CNN method produces flexible context-Aware occupancy estimations for semantically uniform map regions and generalizes well already with small amounts of training data. Evaluated on synthetic and real-world data, it shows superior results compared to several baselines, marking a qualitative step-up in semantic environment assessment.
KW - Deep learning for visual perception
KW - human detection and tracking
KW - human motion analysis
KW - human motion prediction
KW - semantic scene understanding
UR - http://www.scopus.com/inward/record.url?scp=85101765394&partnerID=8YFLogxK
U2 - 10.1109/LRA.2021.3062010
DO - 10.1109/LRA.2021.3062010
M3 - Article
AN - SCOPUS:85101765394
SN - 2377-3766
VL - 6
SP - 3248
EP - 3255
JO - IEEE Robotics and Automation Letters
JF - IEEE Robotics and Automation Letters
IS - 2
M1 - 9362163
ER -