TY - GEN
T1 - 3QFP
T2 - 2024 IEEE International Conference on Robotics and Automation, ICRA 2024
AU - Sun, Shuo
AU - Mielle, Malcolm
AU - Lilienthal, Achim J.
AU - Magnusson, Martin
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Neural implicit surface representations are currently receiving a lot of interest as a means to achieve high-fidelity surface reconstruction at a low memory cost, compared to traditional explicit representations. However, state-of-the-art methods still struggle with excessive memory usage and non-smooth surfaces. This is particularly problematic in large-scale applications with sparse inputs, as is common in robotics use cases. To address these issues, we first introduce a sparse structure, tri-quadtrees, which represents the environment using learnable features stored in three planar quadtree projections. Secondly, we concatenate the learnable features with a Fourier feature positional encoding. The combined features are then decoded into signed distance values through a small multilayer perceptron. We demonstrate that this approach facilitates smoother reconstruction with a higher completion ratio with fewer holes. Compared to two recent baselines, one implicit and one explicit, our approach requires only 10%-50% as much memory, while achieving competitive quality. The code is released on https://github.com/ljjTYJR/3QFP.
AB - Neural implicit surface representations are currently receiving a lot of interest as a means to achieve high-fidelity surface reconstruction at a low memory cost, compared to traditional explicit representations. However, state-of-the-art methods still struggle with excessive memory usage and non-smooth surfaces. This is particularly problematic in large-scale applications with sparse inputs, as is common in robotics use cases. To address these issues, we first introduce a sparse structure, tri-quadtrees, which represents the environment using learnable features stored in three planar quadtree projections. Secondly, we concatenate the learnable features with a Fourier feature positional encoding. The combined features are then decoded into signed distance values through a small multilayer perceptron. We demonstrate that this approach facilitates smoother reconstruction with a higher completion ratio with fewer holes. Compared to two recent baselines, one implicit and one explicit, our approach requires only 10%-50% as much memory, while achieving competitive quality. The code is released on https://github.com/ljjTYJR/3QFP.
UR - http://www.scopus.com/inward/record.url?scp=85202450420&partnerID=8YFLogxK
U2 - 10.1109/ICRA57147.2024.10610338
DO - 10.1109/ICRA57147.2024.10610338
M3 - Conference contribution
AN - SCOPUS:85202450420
T3 - Proceedings - IEEE International Conference on Robotics and Automation
SP - 4036
EP - 4044
BT - 2024 IEEE International Conference on Robotics and Automation, ICRA 2024
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 13 May 2024 through 17 May 2024
ER -