TY - JOUR
T1 - Co-Enhanced Global-Part Integration for Remote-Sensing Scene Classification
AU - Zhao, Yichen
AU - Chen, Yaxiong
AU - Xiong, Shengwu
AU - Lu, Xiaoqiang
AU - Zhu, Xiao Xiang
AU - Mou, Lichao
N1 - Publisher Copyright:
© 1980-2012 IEEE.
PY - 2024
Y1 - 2024
N2 - Remote-sensing (RS) scene classification aims to classify RS images with similar scene characteristics into one category. Plenty of RS images are complex in background, rich in content, and multiscale in target, exhibiting the characteristics of both intraclass separation and interclass convergence. Therefore, discriminative feature representations designed to highlight the differences between classes are the key to RS scene classification. Existing methods represent scene images by extracting either global context or discriminative part features from RS images. However, global-based methods often lack salient details in similar RS scenes, while part-based methods tend to ignore the relationships between local ground objects, thus weakening the discriminative feature representation. In this article, we propose to combine global context and part-level discriminative features within a unified framework called CGINet for accurate RS scene classification. To be specific, we develop a light context-aware attention block (LCAB) to explicitly model the global context to obtain larger receptive fields and contextual information. A co-enhanced loss module (CELM) is also devised to encourage the model to actively locate discriminative parts for feature enhancement. In particular, CELM is only used during training and not activated during inference, which introduces less computational cost. Benefiting from LCAB and CELM, our proposed CGINet improves the discriminability of features, thereby improving classification performance. Comprehensive experiments over four benchmark datasets show that the proposed method achieves consistent performance gains over state-of-the-art (SOTA) RS scene classification methods.
AB - Remote-sensing (RS) scene classification aims to classify RS images with similar scene characteristics into one category. Plenty of RS images are complex in background, rich in content, and multiscale in target, exhibiting the characteristics of both intraclass separation and interclass convergence. Therefore, discriminative feature representations designed to highlight the differences between classes are the key to RS scene classification. Existing methods represent scene images by extracting either global context or discriminative part features from RS images. However, global-based methods often lack salient details in similar RS scenes, while part-based methods tend to ignore the relationships between local ground objects, thus weakening the discriminative feature representation. In this article, we propose to combine global context and part-level discriminative features within a unified framework called CGINet for accurate RS scene classification. To be specific, we develop a light context-aware attention block (LCAB) to explicitly model the global context to obtain larger receptive fields and contextual information. A co-enhanced loss module (CELM) is also devised to encourage the model to actively locate discriminative parts for feature enhancement. In particular, CELM is only used during training and not activated during inference, which introduces less computational cost. Benefiting from LCAB and CELM, our proposed CGINet improves the discriminability of features, thereby improving classification performance. Comprehensive experiments over four benchmark datasets show that the proposed method achieves consistent performance gains over state-of-the-art (SOTA) RS scene classification methods.
KW - Attention
KW - convolutional neural networks (CNNs)
KW - discriminative part discovery
KW - remote-sensing (RS)
KW - scene classification
UR - http://www.scopus.com/inward/record.url?scp=85186093535&partnerID=8YFLogxK
U2 - 10.1109/TGRS.2024.3367877
DO - 10.1109/TGRS.2024.3367877
M3 - Article
AN - SCOPUS:85186093535
SN - 0196-2892
VL - 62
SP - 1
EP - 14
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
M1 - 4702114
ER -