Views

CrossRef citations to date

Altmetric

Research Article

MC-Net: multi-scale contextual information aggregation network for image captioning on remote sensing images

Haiyan Huanga State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, People's Republic of China

https://orcid.org/0000-0002-9931-9884 View further author information

Zhenfeng Shaoa State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, People's Republic of ChinaCorrespondence[email protected]

https://orcid.org/0000-0003-4587-6826 View further author information

Qimin Chengb School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, People's Republic of ChinaView further author information

Xiao Huangc Department of Geosciences, University of Arkansas, Fayetteville, USA

https://orcid.org/0000-0002-4323-382X View further author information

Xiaoping Wud School of Geography and Resources Science, Sichuan Normal University, Sichuan, People's Republic of ChinaView further author information

Guoming Lie School of Resources and Environment, University of Electronic Science and Technology, Sichuan, People's Republic of ChinaView further author information

Li Tanf School of Geophysics, Chengdu University of Technology, Sichuan, People's Republic of ChinaView further author information

show all

MC-Net: multi-scale contextual information aggregation network for image captioning on remote sensing images

Table 1. Settings and results of ablation experiments on the UCM-Captions.

Table 2. Settings and results of ablation experiments on the Sydney-Captions.

Table 3. Settings and results of ablation experiments on the RSICD.

Table 4. Settings and results of ablation experiments on the NWPU-Captions.

Table 5. Comparative results of MC-Net on the UCM-Captions.

Table 6. Comparative results of MC-Net on the Sydney-Captions.

Table 7. Comparative results of MC-Net on the RSICD.

Table 8. Comparative results of MC-Net on the NWPU-Captions.

Table 9. Captioning performance Comparision with different multi-heads(H) values on the UCM-Captions.

Table 10. Comparison of our methods in terms of inference speed (images per second), MACs and parameters. All results are reported based on the UCM-Captions.

Table 11. The comparison results with different scale of contextual information on the UCM-Captions.

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

MC-Net: multi-scale contextual information aggregation network for image captioning on remote sensing images

Figures & data

Table 1. Settings and results of ablation experiments on the UCM-Captions.

Table 2. Settings and results of ablation experiments on the Sydney-Captions.

Table 3. Settings and results of ablation experiments on the RSICD.

Table 4. Settings and results of ablation experiments on the NWPU-Captions.

Table 5. Comparative results of MC-Net on the UCM-Captions.

Table 6. Comparative results of MC-Net on the Sydney-Captions.

Table 7. Comparative results of MC-Net on the RSICD.

Table 8. Comparative results of MC-Net on the NWPU-Captions.

Table 9. Captioning performance Comparision with different multi-heads(H) values on the UCM-Captions.

Table 10. Comparison of our methods in terms of inference speed (images per second), MACs and parameters. All results are reported based on the UCM-Captions.

Table 11. The comparison results with different scale of contextual information on the UCM-Captions.

Data availability statement

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date