CroCPS: Addressing Photometric Challenges in Self-Supervised Category-Level 6D Object Poses with Cross-Modal Learning

Pengyuan Wang, Lorenzo Garattoni, Sven Meier, Nassir Navab, Benjamin Busam

Publikation: KonferenzbeitragPapierBegutachtung

1 Zitat (Scopus)

Abstract

Estimating 6D object poses for everyday household objects is a crucial and challenging task for robotic applications. Recent advances in category-level object pose estimation show great potential in this direction. Since the training of the networks relies heavily on ground truth 6D poses, which are expensive to annotate in real environments, self-supervised methods become a realistic approach to overcome the domain gap between synthetic and real images. However, these methods work poorly on photometrically-challenging objects because of the missing depth or artifacts in RGBD data. We propose to use the polarization clues to overcome the drawbacks of RGBD images and improve the detection performance for objects with specular surfaces in the self-supervision stage. To this end, we generate a synthetic dataset containing cutlery of various shapes and sizes, and a markerless real dataset with accurate 6D pose annotations. We introduce several novel losses for self-supervision based on inputs of multiple modalities which fully utilize the polarization information. The experiment result shows that the proposed method improves both 2D detection and 3D IoU of the predicted bounding boxes over SOTA methods without usage of annotated ground truth. This work constitutes the first solution for self-supervision on challenging reflective objects and explores the usage of polarization images. We evaluate the effectiveness of the proposed pipeline by proposing synthetic and real data and thorough evaluations.

OriginalspracheEnglisch
PublikationsstatusVeröffentlicht - 2022
Veranstaltung33rd British Machine Vision Conference Proceedings, BMVC 2022 - London, Großbritannien/Vereinigtes Königreich
Dauer: 21 Nov. 202224 Nov. 2022

Konferenz

Konferenz33rd British Machine Vision Conference Proceedings, BMVC 2022
Land/GebietGroßbritannien/Vereinigtes Königreich
OrtLondon
Zeitraum21/11/2224/11/22

Fingerprint

Untersuchen Sie die Forschungsthemen von „CroCPS: Addressing Photometric Challenges in Self-Supervised Category-Level 6D Object Poses with Cross-Modal Learning“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren