TY - JOUR
T1 - TOWARDS LARGE-SCALE BUILDING ATTRIBUTE MAPPING USING CROWDSOURCED IMAGES
T2 - 5th Geospatial Week 2023, GSW 2023
AU - Sun, Y.
AU - Kruspe, A.
AU - Meng, L.
AU - Tian, Y.
AU - Hoffmann, E. J.
AU - Auer, S.
AU - Zhu, X. X.
N1 - Publisher Copyright:
© Author(s) 2023.
PY - 2023/12/14
Y1 - 2023/12/14
N2 - Crowdsourced platforms provide huge amounts of street-view images that contain valuable building information. This work addresses the challenges in applying Scene Text Recognition (STR) in crowdsourced street-view images for building attribute mapping. We use Flickr images, particularly examining texts on building facades. A Berlin Flickr dataset is created, and pre-trained STR models are used for text detection and recognition. Manual checking on a subset of STR-recognized images demonstrates high accuracy. We examined the correlation between STR results and building functions, and analysed instances where texts were recognized on residential buildings but not on commercial ones. Further investigation revealed significant challenges associated with this task, including small text regions in street-view images, the absence of ground truth labels, and mismatches in buildings in Flickr images and building footprints in OpenStreetMap (OSM). To develop city-wide mapping beyond urban hotspot locations, we suggest differentiating the scenarios where STR proves effective while developing appropriate algorithms or bringing in additional data for handling other cases. Furthermore, interdisciplinary collaboration should be undertaken to understand the motivation behind building photography and labeling. The STR-on-Flickr results are publicly available at https://github.com/ya0-sun/STR-Berlin.
AB - Crowdsourced platforms provide huge amounts of street-view images that contain valuable building information. This work addresses the challenges in applying Scene Text Recognition (STR) in crowdsourced street-view images for building attribute mapping. We use Flickr images, particularly examining texts on building facades. A Berlin Flickr dataset is created, and pre-trained STR models are used for text detection and recognition. Manual checking on a subset of STR-recognized images demonstrates high accuracy. We examined the correlation between STR results and building functions, and analysed instances where texts were recognized on residential buildings but not on commercial ones. Further investigation revealed significant challenges associated with this task, including small text regions in street-view images, the absence of ground truth labels, and mismatches in buildings in Flickr images and building footprints in OpenStreetMap (OSM). To develop city-wide mapping beyond urban hotspot locations, we suggest differentiating the scenarios where STR proves effective while developing appropriate algorithms or bringing in additional data for handling other cases. Furthermore, interdisciplinary collaboration should be undertaken to understand the motivation behind building photography and labeling. The STR-on-Flickr results are publicly available at https://github.com/ya0-sun/STR-Berlin.
KW - Building Attributes
KW - Building Function
KW - Crowdsource
KW - Flickr
KW - OpenStreetMap (OSM)
KW - Scene Text Recognition (STR)
KW - Street-view Images (SVI)
UR - http://www.scopus.com/inward/record.url?scp=85183288890&partnerID=8YFLogxK
U2 - 10.5194/isprs-archives-XLVIII-1-W2-2023-225-2023
DO - 10.5194/isprs-archives-XLVIII-1-W2-2023-225-2023
M3 - Conference article
AN - SCOPUS:85183288890
SN - 1682-1750
VL - 48
SP - 225
EP - 232
JO - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives
JF - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives
IS - 1/W2-2023
Y2 - 2 September 2023 through 7 September 2023
ER -