TY - JOUR
T1 - Large-scale building height retrieval from single SAR imagery based on bounding box regression networks
AU - Sun, Yao
AU - Mou, Lichao
AU - Wang, Yuanyuan
AU - Montazeri, Sina
AU - Zhu, Xiao Xiang
N1 - Publisher Copyright:
© 2021 The Authors
PY - 2022/2
Y1 - 2022/2
N2 - Building height retrieval from synthetic aperture radar (SAR) imagery is of great importance for urban applications, yet highly challenging due to the complexity of SAR data. This paper addresses the issue of building height retrieval in large-scale urban areas from a single TerraSAR-X spotlight or stripmap image. Based on the radar viewing geometry, we propose that this problem be formulated as a bounding box regression problem and therefore allows for integrating height data from multiple data sources in generating ground truth on a larger scale. We introduce building footprints from geographic information system (GIS) data as complementary information and propose a bounding box regression network that exploits the location relationship between a building's footprint and its bounding box, enabling fast computation. The method is validated on four urban data sets using TerraSAR-X images in both high-resolution spotlight and stripmap modes. Experimental results show that the proposed network can reduce the computation cost significantly while keeping the height accuracy of individual buildings compared to a Faster R-CNN based method. Moreover, we investigate the impact of inaccurate GIS data on our proposed network, and this study shows that the bounding box regression network is robust against positioning errors in GIS data. The proposed method has great potential to be applied to regional or even global scales. Our code will be made publicly available at github.com/ya0-sun/bbox-SAR-building.
AB - Building height retrieval from synthetic aperture radar (SAR) imagery is of great importance for urban applications, yet highly challenging due to the complexity of SAR data. This paper addresses the issue of building height retrieval in large-scale urban areas from a single TerraSAR-X spotlight or stripmap image. Based on the radar viewing geometry, we propose that this problem be formulated as a bounding box regression problem and therefore allows for integrating height data from multiple data sources in generating ground truth on a larger scale. We introduce building footprints from geographic information system (GIS) data as complementary information and propose a bounding box regression network that exploits the location relationship between a building's footprint and its bounding box, enabling fast computation. The method is validated on four urban data sets using TerraSAR-X images in both high-resolution spotlight and stripmap modes. Experimental results show that the proposed network can reduce the computation cost significantly while keeping the height accuracy of individual buildings compared to a Faster R-CNN based method. Moreover, we investigate the impact of inaccurate GIS data on our proposed network, and this study shows that the bounding box regression network is robust against positioning errors in GIS data. The proposed method has great potential to be applied to regional or even global scales. Our code will be made publicly available at github.com/ya0-sun/bbox-SAR-building.
KW - Bounding box regression
KW - Building height
KW - Deep convolutional neural network (CNN)
KW - Geographic information system (GIS)
KW - Large-scale urban areas
KW - Synthetic aperture radar (SAR)
UR - http://www.scopus.com/inward/record.url?scp=85122009069&partnerID=8YFLogxK
U2 - 10.1016/j.isprsjprs.2021.11.024
DO - 10.1016/j.isprsjprs.2021.11.024
M3 - Article
AN - SCOPUS:85122009069
SN - 0924-2716
VL - 184
SP - 79
EP - 95
JO - ISPRS Journal of Photogrammetry and Remote Sensing
JF - ISPRS Journal of Photogrammetry and Remote Sensing
ER -