Abstract
Dense image alignment from RGB-D images remains a critical issue for real-world applications, especially under challenging lighting conditions and in a wide baseline setting. In this letter, we propose a new framework to learn a pixel-wise deep feature map and a deep feature-metric uncertainty map predicted by a Convolutional Neural Network (CNN), which together formulate a deep probabilistic feature-metric residual of the two-view constraint that can be minimised using Gauss-Newton in a coarse-to-fine optimisation framework. Furthermore, our network predicts a deep initial pose for faster and more reliable convergence. The optimisation steps are differentiable and unrolled to train in an end-to-end fashion. Due to its probabilistic essence, our approach can easily couple with other residuals, where we show a combination with ICP. Experimental results demonstrate state-of-the-art performances on the TUM RGB-D dataset and the 3D rigid object tracking dataset. We further demonstrate our method's robustness and convergence qualitatively.
| Original language | English |
|---|---|
| Article number | 9264717 |
| Pages (from-to) | 223-230 |
| Number of pages | 8 |
| Journal | IEEE Robotics and Automation Letters |
| Volume | 6 |
| Issue number | 1 |
| DOIs | |
| State | Published - Jan 2021 |
| Externally published | Yes |
Keywords
- Deep learning for visual perception
- SLAM
Fingerprint
Dive into the research topics of 'Deep Probabilistic Feature-Metric Tracking'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver