S4C: Self-Supervised Semantic Scene Completion with Neural Fields

Adrian Hayler, Felix Wimbauer, Dominik Muhle, Christian Rupprecht, Daniel Cremers

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

Abstract

3D semantic scene understanding is a fundamental challenge in computer vision. It enables mobile agents to autonomously plan and navigate arbitrary environments. SSC formalizes this challenge as jointly estimating dense geometry and semantic information from sparse observations of a scene. Current methods for SSC are generally trained on 3D ground truth based on aggregated LiDAR scans. This process relies on special sensors and annotation by hand which are costly and do not scale well. To overcome this issue, our work presents the first self-supervised approach to SSC called S4C that does not rely on 3D ground truth data. Our proposed method can reconstruct a scene from a single image and only relies on videos and pseudo segmentation ground truth generated from off-the-shelf image segmentation network during training. Unlike existing methods, which use discrete voxel grids, we represent scenes as implicit semantic fields. This formulation allows querying any point within the camera frustum for occupancy and semantic class. Our architecture is trained through rendering-based self-supervised losses. Nonetheless, our method achieves performance close to fully supervised state-of-the-art methods. Additionally, our method demonstrates strong generalization capabilities and can synthesize accurate segmentation maps for far away viewpoints.

OriginalspracheEnglisch
TitelProceedings - 2024 International Conference on 3D Vision, 3DV 2024
Herausgeber (Verlag)Institute of Electrical and Electronics Engineers Inc.
Seiten409-420
Seitenumfang12
ISBN (elektronisch)9798350362459
DOIs
PublikationsstatusVeröffentlicht - 2024
Veranstaltung11th International Conference on 3D Vision, 3DV 2024 - Davos, Schweiz
Dauer: 18 März 202421 März 2024

Publikationsreihe

NameProceedings - 2024 International Conference on 3D Vision, 3DV 2024

Konferenz

Konferenz11th International Conference on 3D Vision, 3DV 2024
Land/GebietSchweiz
OrtDavos
Zeitraum18/03/2421/03/24

Fingerprint

Untersuchen Sie die Forschungsthemen von „S4C: Self-Supervised Semantic Scene Completion with Neural Fields“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren