TY - JOUR
T1 - Indoor Synthetic Data Generation
T2 - A Systematic Review
AU - Schieber, Hannah
AU - Demir, Kubilay Can
AU - Kleinbeck, Constantin
AU - Yang, Seung Hee
AU - Roth, Daniel
N1 - Publisher Copyright:
© 2024 The Authors
PY - 2024/3
Y1 - 2024/3
N2 - Objective: Deep learning-based object recognition, 6D pose estimation, and semantic scene understanding require a large amount of training data to achieve generalization. Time-consuming annotation processes, privacy, and security aspects lead to a scarcity of real-world datasets. To overcome this lack of data, synthetic data generation has been proposed, including multiple facets in the area of domain randomization to extend the data distribution. The objective of this review is to identify methods applied for synthetic data generation aiming to improve 6D pose estimation, object recognition, and semantic scene understanding in indoor scenarios. We further review methods used to extend the data distribution and discuss best practices to bridge the gap between synthetic and real-world data. Methods: We adhered to the guidelines of the systematic PRISMA technique. Three databases, IEEE Xplore, Springer Link, and ACM, and an additional manual search were conducted. In total, we identified 241 studies and included 34 in our systematic review. Conclusion: In summary, synthetic data generation has been performed using crop-out methods, graphic APIs, 3D modeling or authoring tools, or game engine-based methods. To extend the data distribution, varying scene parameters, i.e., lighting conditions or textures and the use of distracting objects in the scene are promising.
AB - Objective: Deep learning-based object recognition, 6D pose estimation, and semantic scene understanding require a large amount of training data to achieve generalization. Time-consuming annotation processes, privacy, and security aspects lead to a scarcity of real-world datasets. To overcome this lack of data, synthetic data generation has been proposed, including multiple facets in the area of domain randomization to extend the data distribution. The objective of this review is to identify methods applied for synthetic data generation aiming to improve 6D pose estimation, object recognition, and semantic scene understanding in indoor scenarios. We further review methods used to extend the data distribution and discuss best practices to bridge the gap between synthetic and real-world data. Methods: We adhered to the guidelines of the systematic PRISMA technique. Three databases, IEEE Xplore, Springer Link, and ACM, and an additional manual search were conducted. In total, we identified 241 studies and included 34 in our systematic review. Conclusion: In summary, synthetic data generation has been performed using crop-out methods, graphic APIs, 3D modeling or authoring tools, or game engine-based methods. To extend the data distribution, varying scene parameters, i.e., lighting conditions or textures and the use of distracting objects in the scene are promising.
KW - Domain randomization
KW - Indoor synthetic data
KW - Synthetic data generation
UR - http://www.scopus.com/inward/record.url?scp=85182025698&partnerID=8YFLogxK
U2 - 10.1016/j.cviu.2023.103907
DO - 10.1016/j.cviu.2023.103907
M3 - Article
AN - SCOPUS:85182025698
SN - 1077-3142
VL - 240
JO - Computer Vision and Image Understanding
JF - Computer Vision and Image Understanding
M1 - 103907
ER -