TY - GEN
T1 - WARM-3D
T2 - 27th IEEE International Conference on Intelligent Transportation Systems, ITSC 2024
AU - Zhou, Xingcheng
AU - Fu, Deyu
AU - Zimmer, Walter
AU - Liu, Mingyu
AU - Lakshminarasimhan, Venkatnarayanan
AU - Strand, Leah
AU - Knoll, Alois C.
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Existing roadside perception systems are limited by the absence of publicly available, large-scale, high-quality 3D datasets. Exploring the use of cost-effective, extensive synthetic datasets offers a viable solution to tackle this challenge and enhance the performance of roadside monocular 3D detection. In this study, we introduce the TUMTraf Synthetic Dataset, offering a diverse and substantial collection of high-quality 3D data to augment scarce real-world datasets. Besides, we present WARM-3D, a concise yet effective framework to aid the Sim2Real domain transfer for roadside monocular 3D detection. Our method leverages cheap synthetic datasets and 2D labels from an off-the-shelf 2D detector for weak supervision. We show that WARM-3D significantly enhances performance, achieving a +12.40% increase in mAP3D over the baseline with only pseudo-2D supervision. With 2D GT as weak labels, WARM-3D even reaches performance close to the Oracle baseline. Moreover, WARM-3D improves the ability of 3D detectors to unseen sample recognition across various real-world environments, highlighting its potential for practical applications.
AB - Existing roadside perception systems are limited by the absence of publicly available, large-scale, high-quality 3D datasets. Exploring the use of cost-effective, extensive synthetic datasets offers a viable solution to tackle this challenge and enhance the performance of roadside monocular 3D detection. In this study, we introduce the TUMTraf Synthetic Dataset, offering a diverse and substantial collection of high-quality 3D data to augment scarce real-world datasets. Besides, we present WARM-3D, a concise yet effective framework to aid the Sim2Real domain transfer for roadside monocular 3D detection. Our method leverages cheap synthetic datasets and 2D labels from an off-the-shelf 2D detector for weak supervision. We show that WARM-3D significantly enhances performance, achieving a +12.40% increase in mAP3D over the baseline with only pseudo-2D supervision. With 2D GT as weak labels, WARM-3D even reaches performance close to the Oracle baseline. Moreover, WARM-3D improves the ability of 3D detectors to unseen sample recognition across various real-world environments, highlighting its potential for practical applications.
UR - http://www.scopus.com/inward/record.url?scp=105001675236&partnerID=8YFLogxK
U2 - 10.1109/ITSC58415.2024.10919929
DO - 10.1109/ITSC58415.2024.10919929
M3 - Conference contribution
AN - SCOPUS:105001675236
T3 - IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC
SP - 3489
EP - 3496
BT - 2024 IEEE 27th International Conference on Intelligent Transportation Systems, ITSC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 24 September 2024 through 27 September 2024
ER -