TY - GEN
T1 - Classification of settlement types from tweets using LDA and LSTM
AU - Huang, Rong
AU - Taubenböck, Hannes
AU - Mou, Lichao
AU - Zhu, Xiao Xiang
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/10/31
Y1 - 2018/10/31
N2 - Land use reflects the interrelation between the physically built environment and the activity patterns of people. It is indispensable information for decision-makes, but up-to-date and accurate land use information is often absent. Unlike approaches that make use of remote sensing data, in this work, we are interested in a novel data source, tweets, and explore its potential for land use classification in urban areas. Specifically, we propose a general framework for classifying settlement land-use types by extracting location, time, quantity and text features of twitter data. To do so, we apply latent Dirichlet allocation (LDA) and long short-term memory (LSTM) and then combines those features with spatial-temporal feature using Fused SVM and a two-stream convolutional neural network (CNN) for classification. For the case of classifying individual tweets by the land-use classes relevant in this study - residential, non-residential and mixed usage -, we reach overall accuracy (OA), average accuracy (AA), and Kappa coefficient with 72.35%, 73.76%, and 58.43%, respectively. As for the case of classifying block settlement types, we reach 61.90%, 63.33%, and 42.84%, respectively.
AB - Land use reflects the interrelation between the physically built environment and the activity patterns of people. It is indispensable information for decision-makes, but up-to-date and accurate land use information is often absent. Unlike approaches that make use of remote sensing data, in this work, we are interested in a novel data source, tweets, and explore its potential for land use classification in urban areas. Specifically, we propose a general framework for classifying settlement land-use types by extracting location, time, quantity and text features of twitter data. To do so, we apply latent Dirichlet allocation (LDA) and long short-term memory (LSTM) and then combines those features with spatial-temporal feature using Fused SVM and a two-stream convolutional neural network (CNN) for classification. For the case of classifying individual tweets by the land-use classes relevant in this study - residential, non-residential and mixed usage -, we reach overall accuracy (OA), average accuracy (AA), and Kappa coefficient with 72.35%, 73.76%, and 58.43%, respectively. As for the case of classifying block settlement types, we reach 61.90%, 63.33%, and 42.84%, respectively.
KW - Land use classification
KW - Latent dirichlet allocation (LDA)
KW - Long short-term memory (LSTM)
KW - Two-stream convolutional neural network (CNN)
UR - http://www.scopus.com/inward/record.url?scp=85062993884&partnerID=8YFLogxK
U2 - 10.1109/IGARSS.2018.8519240
DO - 10.1109/IGARSS.2018.8519240
M3 - Conference contribution
AN - SCOPUS:85062993884
T3 - International Geoscience and Remote Sensing Symposium (IGARSS)
SP - 6408
EP - 6411
BT - 2018 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2018 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 38th Annual IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2018
Y2 - 22 July 2018 through 27 July 2018
ER -