TY - GEN
T1 - GraphRelate3D
T2 - 27th IEEE International Conference on Intelligent Transportation Systems, ITSC 2024
AU - Liu, Mingyu
AU - Yurtsever, Ekim
AU - Brede, Marc
AU - Meng, Jun
AU - Zimmer, Walter
AU - Zhou, Xingcheng
AU - Zagar, Bare Luka
AU - Cui, Yuning
AU - Knoll, Alois C.
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Accurate and effective 3D object detection is critical for ensuring the driving safety of autonomous vehicles. Recently, state-of-the-art two-stage 3D object detectors have exhibited promising performance. However, these methods refine proposals individually, ignoring the rich contextual information in the object relationships between the neighbor proposals. In this study, we introduce an object relation module, consisting of a graph generator and a graph neural network (GNN), to learn the spatial information from certain patterns to improve 3D object detection. Specifically, we create an inter-object relationship graph based on proposals in a frame via the graph generator to connect each proposal with its neighbor proposals. Afterward, the GNN module extracts edge features from the generated graph and iteratively refines proposal features with the captured edge features. Ultimately, we leverage the refined features as input to the detection head to obtain detection results. Our approach improves upon the baseline PV-RCNN on the KITTI validation set for the car class across easy, moderate, and hard difficulty levels by 0.82%, 0.74%, and 0.58%, respectively. Additionally, our method outperforms the baseline by more than 1% under the moderate and hard levels BEV AP on the test server.
AB - Accurate and effective 3D object detection is critical for ensuring the driving safety of autonomous vehicles. Recently, state-of-the-art two-stage 3D object detectors have exhibited promising performance. However, these methods refine proposals individually, ignoring the rich contextual information in the object relationships between the neighbor proposals. In this study, we introduce an object relation module, consisting of a graph generator and a graph neural network (GNN), to learn the spatial information from certain patterns to improve 3D object detection. Specifically, we create an inter-object relationship graph based on proposals in a frame via the graph generator to connect each proposal with its neighbor proposals. Afterward, the GNN module extracts edge features from the generated graph and iteratively refines proposal features with the captured edge features. Ultimately, we leverage the refined features as input to the detection head to obtain detection results. Our approach improves upon the baseline PV-RCNN on the KITTI validation set for the car class across easy, moderate, and hard difficulty levels by 0.82%, 0.74%, and 0.58%, respectively. Additionally, our method outperforms the baseline by more than 1% under the moderate and hard levels BEV AP on the test server.
UR - http://www.scopus.com/inward/record.url?scp=105001672022&partnerID=8YFLogxK
U2 - 10.1109/ITSC58415.2024.10919972
DO - 10.1109/ITSC58415.2024.10919972
M3 - Conference contribution
AN - SCOPUS:105001672022
T3 - IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC
SP - 3481
EP - 3488
BT - 2024 IEEE 27th International Conference on Intelligent Transportation Systems, ITSC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 24 September 2024 through 27 September 2024
ER -