160
Views
0
CrossRef citations to date
0
Altmetric
Research Article

SwinHCST: a deep learning network architecture for scene classification of remote sensing images based on improved CNN and Transformer

, , , , , , , & show all
Pages 7439-7463 | Received 06 Jul 2023, Accepted 08 Nov 2023, Published online: 05 Dec 2023
 

ABSTRACT

Remote sensing image scene classification is a fundamental task in intelligent interpretation of remote sensing images. Although Transformers possess a powerful attention mechanism, they require lengthy training procedures to achieve good performance levels. To address this issue, this paper proposes a novel deep learning network model by combining CNN and Swin Transformer named SwinHCST. Firstly, the model uses Weighted Normalized CNN to quickly extract low-level features of the image. Secondly, the Receptive Field Block module facilitates multi-scale information fusion, Thirdly, the Information Fusion Transformer further excavates the deep-level features of the image. Furthermore, this paper has designed a plug-and-play Cross Spatial Information Fusion Block, which is used to encodes dimensional information and extracts global information to enhance information exchange. The scene classification experiments show that the proposed model outperforms other methods on the three selected datasets and can achieve excellent performance without requiring large amounts of data and training. Specifically, the classification accuracy of the proposed method on the three datasets is 93.76%, 93.60%, and 98.10%, which is 1.7% to 3.71% higher than ResNet50 and 3.7% to 5.7% higher than Swin Transformer.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The RESISC45 dataset, AID dataset and UC Merced dataset used in this work belong to open-source dataset available in their corresponding references within this manuscript. The RESISC45 dataset is available for download at https://pan.baidu.com/s/1mifR6tU. The AID dataset is available for download at [https://pan.baidu.com/s/1mifOBv6. The UC Merced dataset is available for download at http://weegee.vision.ucmerced.edu/datasets/landuse.html.

Additional information

Funding

This work was supported by the [The Fundamental Research Funds for the Central Universities #1] under Grant [number 2572017CB13]; [Heilongjiang Provincial Natural Science Foundation of China #2] under Grant [number QC2016080];

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.