Abstract
Estimating 6D object poses for everyday household objects is a crucial and challenging task for robotic applications. Recent advances in category-level object pose estimation show great potential in this direction. Since the training of the networks relies heavily on ground truth 6D poses, which are expensive to annotate in real environments, self-supervised methods become a realistic approach to overcome the domain gap between synthetic and real images. However, these methods work poorly on photometrically-challenging objects because of the missing depth or artifacts in RGBD data. We propose to use the polarization clues to overcome the drawbacks of RGBD images and improve the detection performance for objects with specular surfaces in the self-supervision stage. To this end, we generate a synthetic dataset containing cutlery of various shapes and sizes, and a markerless real dataset with accurate 6D pose annotations. We introduce several novel losses for self-supervision based on inputs of multiple modalities which fully utilize the polarization information. The experiment result shows that the proposed method improves both 2D detection and 3D IoU of the predicted bounding boxes over SOTA methods without usage of annotated ground truth. This work constitutes the first solution for self-supervision on challenging reflective objects and explores the usage of polarization images. We evaluate the effectiveness of the proposed pipeline by proposing synthetic and real data and thorough evaluations.
Originalsprache | Englisch |
---|---|
Publikationsstatus | Veröffentlicht - 2022 |
Veranstaltung | 33rd British Machine Vision Conference Proceedings, BMVC 2022 - London, Großbritannien/Vereinigtes Königreich Dauer: 21 Nov. 2022 → 24 Nov. 2022 |
Konferenz
Konferenz | 33rd British Machine Vision Conference Proceedings, BMVC 2022 |
---|---|
Land/Gebiet | Großbritannien/Vereinigtes Königreich |
Ort | London |
Zeitraum | 21/11/22 → 24/11/22 |