TY - GEN
T1 - HOIsim
T2 - 30th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2021
AU - Zakour, Marsil
AU - Mellouli, Alaeddine
AU - Chaudhari, Rahul
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/8/8
Y1 - 2021/8/8
N2 - Correct understanding of human activities is critical for meaningful assistance by robots in daily life. The development of perception algorithms and Deep Learning models of human activity requires large-scale sensor datasets. Good real-world activity data is, however, difficult and time- consuming to acquire. Several precisely calibrated and time- synchronized sensors are required, and the annotation and labeling of the collected sensor data is extremely labor intensive.To address these challenges, we present a 3D activity simulator, "HOIsim", focusing on Human-Object Interactions (HOIs). Using HOIsim, we provide a procedurally generated synthetic dataset of two sample daily life activities "lunch"and "breakfast". The dataset contains out-of-the-box ground truth annotations in the form of human and object poses, as well as ground truth activity labels. Furthermore, we introduce methods to meaningfully randomize activity flows and the environment topology. This allows us to generate a large number of random variants of these activities in very less time.Based on an abstraction of the low-level pose data in the form of spatiotemporal graphs of HOIs, we evaluate the generated Lunch dataset only with two Deep Learning models for activity recognition. The first model, based on recurrent neural networks achieves an accuracy of 87%, whereas the other, based on transformers, achieves an accuracy of 94.7%.
AB - Correct understanding of human activities is critical for meaningful assistance by robots in daily life. The development of perception algorithms and Deep Learning models of human activity requires large-scale sensor datasets. Good real-world activity data is, however, difficult and time- consuming to acquire. Several precisely calibrated and time- synchronized sensors are required, and the annotation and labeling of the collected sensor data is extremely labor intensive.To address these challenges, we present a 3D activity simulator, "HOIsim", focusing on Human-Object Interactions (HOIs). Using HOIsim, we provide a procedurally generated synthetic dataset of two sample daily life activities "lunch"and "breakfast". The dataset contains out-of-the-box ground truth annotations in the form of human and object poses, as well as ground truth activity labels. Furthermore, we introduce methods to meaningfully randomize activity flows and the environment topology. This allows us to generate a large number of random variants of these activities in very less time.Based on an abstraction of the low-level pose data in the form of spatiotemporal graphs of HOIs, we evaluate the generated Lunch dataset only with two Deep Learning models for activity recognition. The first model, based on recurrent neural networks achieves an accuracy of 87%, whereas the other, based on transformers, achieves an accuracy of 94.7%.
UR - http://www.scopus.com/inward/record.url?scp=85115084718&partnerID=8YFLogxK
U2 - 10.1109/RO-MAN50785.2021.9515349
DO - 10.1109/RO-MAN50785.2021.9515349
M3 - Conference contribution
AN - SCOPUS:85115084718
T3 - 2021 30th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2021
SP - 1124
EP - 1131
BT - 2021 30th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2021
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 8 August 2021 through 12 August 2021
ER -