TY - JOUR
T1 - Buildings detection in VHR SAR images using fully convolution neural networks
AU - Shahzad, Muhammad
AU - Maurer, Michael
AU - Fraundorfer, Friedrich
AU - Wang, Yuanyuan
AU - Zhu, Xiao Xiang
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2019/2
Y1 - 2019/2
N2 - This paper addresses the highly challenging problem of automatically detecting man-made structures especially buildings in very high-resolution (VHR) synthetic aperture radar (SAR) images. In this context, this paper has two major contributions. First, it presents a novel and generic workflow that initially classifies the spaceborne SAR tomography (TomoSAR) point clouds-generated by processing VHR SAR image stacks using advanced interferometric techniques known as TomoSAR-into buildings and nonbuildings with the aid of auxiliary information (i.e., either using openly available 2-D building footprints or adopting an optical image classification scheme) and later back project the extracted building points onto the SAR imaging coordinates to produce automatic large-scale benchmark labeled (buildings/nonbuildings) SAR data sets. Second, these labeled data sets (i.e., building masks) have been utilized to construct and train the state-of-the-art deep fully convolution neural networks with an additional conditional random field represented as a recurrent neural network to detect building regions in a single VHR SAR image. Such a cascaded formation has been successfully employed in computer vision and remote sensing fields for optical image classification but, to our knowledge, has not been applied to SAR images. The results of the building detection are illustrated and validated over a TerraSAR-X VHR spotlight SAR image covering approximately 39 km 2 - A lmost the whole city of Berlin-with the mean pixel accuracies of around 93.84%.
AB - This paper addresses the highly challenging problem of automatically detecting man-made structures especially buildings in very high-resolution (VHR) synthetic aperture radar (SAR) images. In this context, this paper has two major contributions. First, it presents a novel and generic workflow that initially classifies the spaceborne SAR tomography (TomoSAR) point clouds-generated by processing VHR SAR image stacks using advanced interferometric techniques known as TomoSAR-into buildings and nonbuildings with the aid of auxiliary information (i.e., either using openly available 2-D building footprints or adopting an optical image classification scheme) and later back project the extracted building points onto the SAR imaging coordinates to produce automatic large-scale benchmark labeled (buildings/nonbuildings) SAR data sets. Second, these labeled data sets (i.e., building masks) have been utilized to construct and train the state-of-the-art deep fully convolution neural networks with an additional conditional random field represented as a recurrent neural network to detect building regions in a single VHR SAR image. Such a cascaded formation has been successfully employed in computer vision and remote sensing fields for optical image classification but, to our knowledge, has not been applied to SAR images. The results of the building detection are illustrated and validated over a TerraSAR-X VHR spotlight SAR image covering approximately 39 km 2 - A lmost the whole city of Berlin-with the mean pixel accuracies of around 93.84%.
KW - Building detection
KW - OpenStreetMap (OSM)
KW - SAR tomography (TomoSAR)
KW - TerraSAR-X/TanDEM-X
KW - fully convolution neural networks (CNNs)
KW - synthetic aperture radar (SAR)
UR - http://www.scopus.com/inward/record.url?scp=85054625105&partnerID=8YFLogxK
U2 - 10.1109/TGRS.2018.2864716
DO - 10.1109/TGRS.2018.2864716
M3 - Article
AN - SCOPUS:85054625105
SN - 0196-2892
VL - 57
SP - 1100
EP - 1116
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
IS - 2
M1 - 8486983
ER -