Matterport3D: Learning from RGB-D data in indoor environments

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

922 Zitate (Scopus)

Abstract

Access to large, diverse RGB-D datasets is critical for training RGB-D scene understanding algorithms. However, existing datasets still cover only a limited number of views or a restricted scale of spaces. In this paper, we introduce Matterport3D, a large-scale RGB-D dataset containing 10,800 panoramic views from 194,400 RGB-D images of 90 building-scale scenes. Annotations are provided with surface reconstructions, camera poses, and 2D and 3D semantic segmentations. The precise global alignment and comprehensive, diverse panoramic set of views over entire buildings enable a variety of supervised and self-supervised computer vision tasks, including keypoint matching, view overlap prediction, normal prediction from color, semantic segmentation, and region classification.

OriginalspracheEnglisch
TitelProceedings - 2017 International Conference on 3D Vision, 3DV 2017
Herausgeber (Verlag)Institute of Electrical and Electronics Engineers Inc.
Seiten667-676
Seitenumfang10
ISBN (elektronisch)9781538626108
DOIs
PublikationsstatusVeröffentlicht - 25 Mai 2018
Veranstaltung7th IEEE International Conference on 3D Vision, 3DV 2017 - Qingdao, China
Dauer: 10 Okt. 201712 Okt. 2017

Publikationsreihe

NameProceedings - 2017 International Conference on 3D Vision, 3DV 2017

Konferenz

Konferenz7th IEEE International Conference on 3D Vision, 3DV 2017
Land/GebietChina
OrtQingdao
Zeitraum10/10/1712/10/17

Fingerprint

Untersuchen Sie die Forschungsthemen von „Matterport3D: Learning from RGB-D data in indoor environments“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren