TY - GEN
T1 - Incremental dense semantic stereo fusion for large-scale semantic scene reconstruction
AU - Vineet, Vibhav
AU - Miksik, Ondrej
AU - Lidegaard, Morten
AU - Nießner, Matthias
AU - Golodetz, Stuart
AU - Prisacariu, Victor A.
AU - Kähler, Olaf
AU - Murray, David W.
AU - Izadi, Shahram
AU - Pérez, Patrick
AU - Torr, Philip H.S.
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2015/6/29
Y1 - 2015/6/29
N2 - Our abilities in scene understanding, which allow us to perceive the 3D structure of our surroundings and intuitively recognise the objects we see, are things that we largely take for granted, but for robots, the task of understanding large scenes quickly remains extremely challenging. Recently, scene understanding approaches based on 3D reconstruction and semantic segmentation have become popular, but existing methods either do not scale, fail outdoors, provide only sparse reconstructions or are rather slow. In this paper, we build on a recent hash-based technique for large-scale fusion and an efficient mean-field inference algorithm for densely-connected CRFs to present what to our knowledge is the first system that can perform dense, large-scale, outdoor semantic reconstruction of a scene in (near) real time. We also present a 'semantic fusion' approach that allows us to handle dynamic objects more effectively than previous approaches. We demonstrate the effectiveness of our approach on the KITTI dataset, and provide qualitative and quantitative results showing high-quality dense reconstruction and labelling of a number of scenes.
AB - Our abilities in scene understanding, which allow us to perceive the 3D structure of our surroundings and intuitively recognise the objects we see, are things that we largely take for granted, but for robots, the task of understanding large scenes quickly remains extremely challenging. Recently, scene understanding approaches based on 3D reconstruction and semantic segmentation have become popular, but existing methods either do not scale, fail outdoors, provide only sparse reconstructions or are rather slow. In this paper, we build on a recent hash-based technique for large-scale fusion and an efficient mean-field inference algorithm for densely-connected CRFs to present what to our knowledge is the first system that can perform dense, large-scale, outdoor semantic reconstruction of a scene in (near) real time. We also present a 'semantic fusion' approach that allows us to handle dynamic objects more effectively than previous approaches. We demonstrate the effectiveness of our approach on the KITTI dataset, and provide qualitative and quantitative results showing high-quality dense reconstruction and labelling of a number of scenes.
UR - http://www.scopus.com/inward/record.url?scp=84938253380&partnerID=8YFLogxK
U2 - 10.1109/ICRA.2015.7138983
DO - 10.1109/ICRA.2015.7138983
M3 - Conference contribution
AN - SCOPUS:84938253380
T3 - Proceedings - IEEE International Conference on Robotics and Automation
SP - 75
EP - 82
BT - 2015 IEEE International Conference on Robotics and Automation, ICRA 2015
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2015 IEEE International Conference on Robotics and Automation, ICRA 2015
Y2 - 26 May 2015 through 30 May 2015
ER -