Camera-LiDAR Inconsistency Analysis for Active Learning in Object Detection

Esteban Rivera, Ana Clara Serra Do Nascimento, Markus Lienkamp

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Today, deep learning detectors for autonomous driving are delivering impressive results on public datasets and in real-world applications. However, these detectors require large amounts of data, especially labeled data, to achieve the performance needed to ensure safe driving. The process of collecting and tagging data is expensive and cumbersome. Therefore, the recent focus of the industry has been on how to achieve similar performance while limiting the amount of labeled data required to train such models. Within the cross-modal active learning paradigm, we propose and analyze new strategies to exploit the inconsistencies between camera and LiDAR detectors to improve sampling efficiency and label only the samples that promise improvements for model training. For this, we leverage the 2D projection of the bounding boxes to equalize the output quality of camera and LiDAR detections. Finally, we achieve up to 0.6% AP improvement for camera and 2% improvement for LiDAR over random sampling on the KITTI dataset using a sampling strategy based on the number of detected objects.

Original languageEnglish
Title of host publication35th IEEE Intelligent Vehicles Symposium, IV 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages97-103
Number of pages7
ISBN (Electronic)9798350348811
DOIs
StatePublished - 2024
Event35th IEEE Intelligent Vehicles Symposium, IV 2024 - Jeju Island, Korea, Republic of
Duration: 2 Jun 20245 Jun 2024

Publication series

NameIEEE Intelligent Vehicles Symposium, Proceedings
ISSN (Print)1931-0587
ISSN (Electronic)2642-7214

Conference

Conference35th IEEE Intelligent Vehicles Symposium, IV 2024
Country/TerritoryKorea, Republic of
CityJeju Island
Period2/06/245/06/24

Keywords

  • active learning
  • autonomous driving
  • data efficiency
  • multimodality
  • object detection

Fingerprint

Dive into the research topics of 'Camera-LiDAR Inconsistency Analysis for Active Learning in Object Detection'. Together they form a unique fingerprint.

Cite this