64
Views
0
CrossRef citations to date
0
Altmetric
Research Article

An optimization method of multiscale storage tank target detection introducing an attention mechanism

, , &
Article: 2339304 | Received 04 Jan 2024, Accepted 01 Apr 2024, Published online: 31 May 2024

References

  • Bochkovskiy A, Wang CY, Liao HY Mark. 2020. YOLOv4: optimal speed and accuracy of object detection. https://arxiv.org/pdf/2004.10934.pdf
  • Chen D, Miao DQ. 2023. Control distance IOU and control distance IOU loss for better bounding box regression. Pattern Recognit. 137. doi: 10.1016/j.patcog.2022.109256.
  • Du SJ, Zhang BF, Zhang P, Xiang P. 2021. An improved bounding box regression loss function based on CIOU loss for multi-scale object detection. 2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML); Chengdu, China. p. 92–98. doi: 10.1109/PRML52754.2021.9520717.
  • Duda RO, Hart PE. 1972. Use of the Hough transformation to detect lines and curves in pictures. Commun ACM. 15(1):11–15. doi: 10.1145/361237.361242.
  • Girshick R. 2015. Fast R-CNN. 2015 IEEE International Conference on Computer Vision (ICCV); Santiago, Chile. p.1440–1448. doi: 10.1109/ICCV.2015.169.
  • Girshick R, Donahue J,Darrell T, Malik J. 2014. Rich feature hierarchies for accurate object detection and semantic segmentation. 2014 IEEE Conference on Computer Vision and Pattern Recognition; Columbus, OH, United states. p. 580–587. doi: 10.1109/CVPR.2014.81.
  • Hao ZH, Zhang XY, Chen ZC, Lu KX. 2022. Port storage tanks of target extraction based on convolution neural network. J Beijing Univ Information Sci Technol (Nat Sci Ed). 37(2):8–14. doi: 10.16508/j.carolcarrollnki.11-5866/n.2022.02.002.
  • He KM, Gkioxari G, Dollár P, Girshick R. 2020. Mask R-CNN. IEEE Trans Pattern Anal Mach Intell. 42(2):386–397. doi: 10.1109/TPAMI.2018.2844175.
  • He KM, Zhang XY, Ren SQ, Sun J. 2015. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal Mach Intell. 37(9):1904–1916. doi: 10.1109/TPAMI.2015.2389824.
  • Hou QB, Zhou DQ, Feng JS. 2021. Coordinate attention for efficient mobile network design. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); Nashville, TN, USA. p. 13708–13717. doi: 10.1109/CVPR46437.2021.01350.
  • Jiang B, Chen S, Wang B, Luo B. 2022. MGLNN: semi-supervised learning via multiple graph cooperative learning neural networks. Neural Netw. 153:204–214. doi: 10.1016/j.neunet.2022.05.024.
  • Jiang YF. 2020. Research on building extraction method of high-resolution remote sensing image based on Convolutional neural network [D]. Jiaozuo: Henan University of Science and Technology. doi: 10.27116/dcnki.GJZGC.2020.000185.
  • Li X, Te R, Yi F, Xu G. 2023. CTS-YOLO model for global oil storage tank inspection. Optics and Precision Engin. 31(2):246–262. doi: 10.37188/OPE.20233102.0246.
  • Lin TY, Dollár P, Girshick R, He KM, Hariharan B, Belongie S. 2017. Feature pyramid networks for object detection. 2017 IEEE Conference on Computer Vision and Pattern Recognition(CVPR 2017); Honolulu, HI, United states. p. 936–944. doi: 10.1109/CVPR.2017.106.
  • Liu S, Qi L, Qin HF, Shi JP, Jia JY. 2018. Path aggregation network for instance segmentation. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition; Salt Lake City, UT, USA. p. 8759–8768. doi: 10.1109/CVPR.2018.00913.
  • Liu W, Anguelov D, Erhan D, Szegedy C, Reed SE, Fu CY, Berg A. 2016. SSD: singleshot multibox detector. European Conference on Computer Vision (ECCV 2016); Amsterdam, The Netherlands. p. 21–37. doi: 10.1007/978-3-319-46448-0_2.
  • Ma W. 2005. Oil depot recognition in remote sensing image [D]. Nanjing: Nanjing University of Science and Technology.
  • Redmon J, Divvala S, Girshick R, Farhadi A. 2016. You only look once: unified, real-time object detection. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); Las Vegas, NV, USA. p. 779–788. doi: 10.1109/CVPR.2016.91.
  • Redmon J, Farhadi A. 2017. YOLO9000: better, faster, stronger. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); Honolulu, HI, USA. p. 6517–6525. doi: 10.1109/CVPR.2017.690.
  • Redmon J, Farhadi A. 2018. YOLOv3: an incremental improvement. arXiv:1804.02767. doi: 10.48550/arXiv.1804.02767.
  • Ren SQ, He KM, Girshick R, Sun J. 2017. Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence. 39(6):1137–1149. doi: 10.1109/TPAMI.2016.2577031.
  • Rezatofighi H, Tsoi N, Gwak J,Sadeghian A, Reid I, Savarese S. 2019. Generalized intersection over union: A metric and a loss for bounding box regression. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); Long Beach, CA, USA. p. 658–666. doi: 10.1109/CVPR.2019.00075.
  • Roy AM, Bhaduri J, Kumar T, Raj K. 2023. WilDect-YOLO: an efficient and robust computer vision-based accurate object localization model for automated endangered wildlife detection. Ecol Inf. 75:101919. doi: 10.1016/j.ecoinf.2022.101919.
  • Wang CY, Liao HY Mark, Wu YH, Chen PY, Hsieh JW, Yeh IH. 2020. CSPNet: A New Backbone that can Enhance Learning Capability of CNN. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW 2020); Seattle, WA, USA. p.1571–1580. doi: 10.1109/CVPRW50498.2020.00203.
  • Wang JJ, Li W, Gao YH, Zhang MM, Tao R, Du Q. 2023. Hyperspectral and SAR image classification via multiscale interactive fusion network. IEEE Transact Neural Networks and Learning Syst. 34(12):10823–10837. doi: 10.1109/TNNLS.2022.3171572.
  • Zhang MM, Li W, Zhao XD, Liu H, Tao R, Du Q. 2023. Morphological transformation and spatial-logical aggregation for tree species classification using hyperspectral imagery. IEEE Trans Geosci Remote Sens. 61. doi: 10.1109/TGRS.2022.3233847.
  • Zhang YF, Ren WQ, Zhang Z, Jia Z, Wang L, Tai T. 2022. Focal and efficient IOU loss for accurate bounding box regression. Neurocomputing. 506:146–157. doi: 10.1016/j.neucom.2022.07.042.
  • Zhao W, Yang HP, Shen ZF, Luo JC. 2015. Oil tanks extraction from high resolution imagery using a directional and weighted hough voting method. J Indian Soc Remote Sens. 43(3):539–549. doi: 10.1007/s12524-014-0440-2.
  • Zhu CX, Liu B, Zhou YH, Yu QZ, Liu XZ, Yu WX. 2012. Framework design and implementation for oil tank detection in optical satellite imagery. 2012 IEEE International Geoscience and Remote Sensing Symposium; Munich, Germany. p. 6016–6019. doi: 10.1109/IGARSS.2012.6352236.