TY - JOUR
T1 - Investigation of Deep Learning Datasets for Warehousing Logistics
AU - Holm, Dimitrij Marian
AU - Junge, Philipp
AU - Rutinowski, Jérôme
AU - Fottner, Johannes
N1 - Publisher Copyright:
© 2023, Publish-Ing in cooperation with TIB - Leibniz Information Centre for Science and Technology University Library. All rights reserved.
PY - 2023
Y1 - 2023
N2 - Deep Learning for Computer Vision holds great potential in warehousing logistics, for example for applications such as mobile robots or autonomous forklifts. However, the availability of labelled image datasets within this area is limited. To address this problem, we benchmarked two different datasets, LOCO (Logistics Objects in Context) and TOMIE (Tracking Of Multiple Industrial Entities), to find out, if these datasets can be used interchangeably. Therefore, we examine the usability of these datasets for Object Detection tasks using the YOLOv7 framework. For this we trained several networks and compared them with each other. A deep analysis between these two datasets shows that they are quite different and only suitable for specific tasks which are not interchangeable, despite having emerged from the same research domain. More thorough investigations are performed to find the reasons for this lack of compatibility. To close the gap between LOCO and TOMIE, a synthetic data generation pipeline for pallets is developed and 18,000 synthetic pallet images are rendered. Furthermore, models are trained based on the synthetic data and compared with the models trained on real data. The synthetic data generation pipeline successfully closes the reality gap, and the performance on TOMIE is increased, but the performance on LOCO remains significantly weaker, in comparison. To develop a deeper understanding of this behaviour we examine the underlying datasets and the reasons for the performance difference are identified.
AB - Deep Learning for Computer Vision holds great potential in warehousing logistics, for example for applications such as mobile robots or autonomous forklifts. However, the availability of labelled image datasets within this area is limited. To address this problem, we benchmarked two different datasets, LOCO (Logistics Objects in Context) and TOMIE (Tracking Of Multiple Industrial Entities), to find out, if these datasets can be used interchangeably. Therefore, we examine the usability of these datasets for Object Detection tasks using the YOLOv7 framework. For this we trained several networks and compared them with each other. A deep analysis between these two datasets shows that they are quite different and only suitable for specific tasks which are not interchangeable, despite having emerged from the same research domain. More thorough investigations are performed to find the reasons for this lack of compatibility. To close the gap between LOCO and TOMIE, a synthetic data generation pipeline for pallets is developed and 18,000 synthetic pallet images are rendered. Furthermore, models are trained based on the synthetic data and compared with the models trained on real data. The synthetic data generation pipeline successfully closes the reality gap, and the performance on TOMIE is increased, but the performance on LOCO remains significantly weaker, in comparison. To develop a deeper understanding of this behaviour we examine the underlying datasets and the reasons for the performance difference are identified.
KW - Datasets Generation
KW - Deep Learning
KW - Object Detection
KW - Warehousing Logistics
UR - http://www.scopus.com/inward/record.url?scp=85187991234&partnerID=8YFLogxK
U2 - 10.15488/15311
DO - 10.15488/15311
M3 - Conference article
AN - SCOPUS:85187991234
SN - 2701-6277
SP - 119
EP - 128
JO - Proceedings of the Conference on Production Systems and Logistics
JF - Proceedings of the Conference on Production Systems and Logistics
T2 - 5th Conference on Production Systems and Logistics, CPSL 2023
Y2 - 14 November 2023 through 17 November 2023
ER -