0
Views
0
CrossRef citations to date
0
Altmetric
Research article

Direction-aware multi-branch attention and Gaussian label assignment for remote sensing aggregative object detection

, , ORCID Icon, &
Pages 5917-5945 | Received 06 May 2024, Accepted 12 Jul 2024, Published online: 02 Aug 2024
 

ABSTRACT

A large number of objects of different scales appear aggregative distribution in remote sensing images, which brings two challenges to remote sensing object detection. The first challenge arises from the presence of objects with large scale differences within the same scene, which complicates the detector’s ability to balance the detection performance between large and small objects. The second challenge is that the distribution of dense objects on remote sensing images with complex backgrounds can reduce the feature representation capability of the network, leading to false detections and missed detections of objects. To address the two challenges present in remote sensing object detection, we propose a new object detection network DA-GLANet (Direction-Aware Multi-Branch Attention and Gaussian Label Assignment Balance Net). For the first challenge, we designed a multi-scale feature fusion module to fully utilize semantic information at different scales, thereby improving the detection accuracy of multi-scale objects. Subsequently, to tackle the second challenge, we proposed a multi-branch attention module with direction-aware capabilities. This module can enhance the network’s feature representation of objects of interest by generating attention maps that are both direction-aware and position-sensitive. In addition, we propose a label assignment strategy based on Gaussian distribution to address the imbalance of positive and negative labels caused by the complexity of remote sensing images. These methods work synergistically to improve aggregative object detection accuracy. We have conducted extensive experiments on the DOTA dataset, Levir dataset, and WHU buildings change detection dataset to demonstrate the superiority of our method. On the DOTA dataset, our method improves mAP by 0.4%, 0.3%, and 0.2% compared to the top three methods. On the Levir dataset, our method improves mAP by 2.5%, 2.2%, and 1.6% compared to the top three methods.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by the National Key Research and Development Program of China [Grant No. 2022YFC3005703], the National Natural Science Foundation of China [41771451], the Natural Science Foundation of Sichuan, China[2022NSFSC0409].

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 689.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.