497
Views
0
CrossRef citations to date
0
Altmetric
Method

Boundary-enhanced dual-stream network for semantic segmentation of high-resolution remote sensing images

, , , , &
Article: 2356355 | Received 17 Jul 2023, Accepted 13 May 2024, Published online: 01 Jun 2024

References

  • Badrinarayanan, V., A. Kendall, and R. Cipolla. 2017. “SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.” IEEE Transactions on Pattern Analysis and Machine Intelligence 39 (12): 2481–22. https://doi.org/10.1109/TPAMI.2016.2644615.
  • Chen, L. C., J. T. Barron, G. Papandreou, K. Murphy, and A. L. Yuille. 2016. “Semantic Image Segmentation with Task-Specific Edge Detection Using CNNs and a Discriminatively Trained Domain Transform.” 4545–4554. https://doi.org/10.1109/CVPR.2016.492.
  • Cheng, D., G. Meng, S. Xiang, and C. Pan. 2017. “FusionNet: Edge Aware Deep Convolutional Networks for Semantic Segmentation of Remote Sensing Harbor Images.” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 10 (12): 5769–5783. https://doi.org/10.1109/JSTARS.2017.2747599.
  • Chen, L. C., G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille. 2015. “Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs.” 357–361. https://doi.org/10.1080/17476938708814211.
  • Chen, L. C., G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille. 2018. “DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.” IEEE Transactions on Pattern Analysis and Machine Intelligence 40 (4): 834–848. https://doi.org/10.1109/TPAMI.2017.2699184.
  • Chen, L.-C., G. Papandreou, F. Schroff, and H. Adam. 2017. “Rethinking Atrous Convolution for Semantic Image Segmentation.” Arxiv, Dec 05. https://doi.org/10.48550/arXiv.1706.05587.
  • Chen, L. I.-C., Y. Zhu, G. Papandreou, F. Schroff, and H. Adam. 2018. “Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation.” Lecture Notes in Computer Science: 833–851. https://doi.org/10.1007/978-3-030-01234-2_49.
  • Diakogiannis, F. I., F. Waldner, P. Caccetta, and C. Wu. 2020. “ResUnet-A: A Deep Learning Framework for Semantic Segmentation of Remotely Sensed Data.” ISPRS Journal of Photogrammetry and Remote Sensing 162:94–114. https://doi.org/10.1016/j.isprsjprs.2020.01.013.
  • Ding, L., H. Tang, and L. Bruzzone. 2020. “LANet: Local Attention Embedding to Improve the Semantic Segmentation of Remote Sensing Images.” IEEE Transactions on Geoscience and Remote Sensing 59 (1): 426–435. https://doi.org/10.1109/TGRS.2020.2994150.
  • Ding, L., J. Zhang, and L. Bruzzone. 2020. “Semantic Segmentation of Large-Size VHR Remote Sensing Images Using a Two-Stage Multiscale Training Architecture.” IEEE Transactions on Geoscience and Remote Sensing 58 (8): 5367–5376. https://doi.org/10.1109/TGRS.2020.2964675.
  • Dong, R., X. Pan, and F. Li. 2019. “DenseU-Net-Based Semantic Segmentation of Small Objects in Urban Remote Sensing Images.” IEEE Access 7:65347–65356. https://doi.org/10.1109/ACCESS.2019.2917952.
  • Du, S., S. Du, B. Liu, and X. Zhang. 2020. “Incorporating DeepLabv3+ and Object-Based Image Analysis for Semantic Segmentation of Very High Resolution Remote Sensing Images.” International Journal of Digital Earth 14 (3): 357–378. https://doi.org/10.1080/17538947.2020.1831087.
  • Engdahl, M. E., and J. M. Hyyppa. 2003. “Land-Cover Classification Using Multitemporal ERS-1/2 InSAR Data.” IEEE Transactions on Geoscience & Remote Sensing 41 (7): 1620–1628. https://doi.org/10.1109/TGRS.2003.813271.
  • Feng, D., C. Haase-Schutz, L. Rosenbaum, H. Hertlein, C. Glaser, F. Timm, W. Wiesbeck, and K. Dietmayer. March, 2021. “Deep Multi-Modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges.” IEEE Transactions on Intelligent Transportation Systems 22 (3): 1341–1360.https://doi.org/10.1109/tits.2020.2972974.
  • Fergus, R., G. W. Taylor, and M. D. Zeiler. 2011. “Adaptive Deconvolutional Networks for Mid and High Level Feature Learning.” 2018–2025. https://doi.org/10.1109/ICCV.2011.6126474.
  • Fu, J., J. Liu, H. Tian, Z. Fang, and H. Lu. 2019. “Dual Attention Network for Scene Segmentation.” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 3141–3149. https://doi.org/10.1109/CVPR.2019.00326.
  • He, K., and J. Sun. 2015. “Convolutional Neural Networks at Constrained Time Cost.” 5353–5360. https://doi.org/10.1109/CVPR.2015.7299173.
  • He, K. M., X. Y. Zhang, S. Q. Ren, J. Sun, and Ieee. 2016. “Deep Residual Learning for Image Recognition.” IEEE Conference on Computer Vision and Pattern Recognition, 770–778. https://doi.org/10.1109/CVPR.2016.90.
  • Huang, G., Z. Liu, V. D. M. Laurens, and K. Q. Weinberger. 2017. “Densely Connected Convolutional Networks.” 2261–2269. https://doi.org/10.1109/CVPR.2017.243.
  • Huang, Z., X. Wang, Y. Wei, L. Huang, H. Shi, W. Liu, and T. S. Huang. June, 2023. “CCNet: Criss-Cross Attention for Semantic Segmentation.” IEEE Transactions on Pattern Analysis & Machine Intelligence 45 (6): 6896–6908. Epub 2023 May. https://doi.org/10.1109/TPAMI.2020.3007032.
  • Huang, J., X. Zhang, Y. Sun, and Q. Xin. 2021. “Attention-Guided Label Refinement Network for Semantic Segmentation of Very High Resolution Aerial Orthoimages.” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 14:4490–4503. https://doi.org/10.1109/JSTARS.2021.3073935.
  • Hu, J., L. Shen, and G. Sun. 2018. “Squeeze-And-Excitation Networks.” 7132–7141. https://doi.org/10.1109/CVPR.2018.00745.
  • ISPRS. 2018. “ISPRS 2D Semantic Labeling Contest.” ISPRS, ed. https://www.isprs.org/education/benchmarks/UrbanSemLab/Default.aspx.
  • Jha, D., P. H. Smedsrud, M. A. Riegler, D. Johansen, T. de Lange, P. Halvorsen. 2019. “ResUnet Plus Plus: An Advanced Architecture for Medical Image Segmentation.” IEEE International Symposium on Multimedia-ISM., 225–230. https://doi.org/10.1109/ISM46123.2019.00049.
  • Jiao, Y. W., X. Wang, W. Wang, and L. Shuang. 2022. “Image Semantic Segmentation Fusion of Edge Detection and AFF Attention Mechanism.” Applied Sciences 12 (21): 11248. https://doi.org/10.3390/app122111248.
  • Jimenez, L. O., J. L. Rivera-Medina, E. Rodriguez-Diaz, E. Arzuaga-Cruz, and M. Ramirez-Velez. 2005. “Integration of Spatial and Spectral Information by Means of Unsupervised Extraction and Classification for Homogenous Objects Applied to Multispectral and Hyperspectral Data.” IEEE Transactions on Geoscience & Remote Sensing 43 (4): 844–851. https://doi.org/10.1109/TGRS.2004.843193.
  • Jin, J., W. Zhou, R. Yang, Y. Lv, and L. Yu. 2023. “Edge Detection Guide Network for Semantic Segmentation of Remote-Sensing Images.” IEEE Geoscience & Remote Sensing Letters 20:1–5. https://doi.org/10.1109/LGRS.2023.3234257.
  • Lateef, F., and Y. Ruichek. 2019, April 21. “Survey on Semantic Segmentation Using Deep Learning Techniques.” Neurocomputing 338:321–348. https://doi.org/10.1016/j.neucom.2019.02.003.
  • Li, R., W. Liu, L. Yang, S. Sun, W. Hu, F. Zhang, and W. Li. 2017. “DeepUnet: A Deep Fully Convolutional Network for Pixel-Level Sea-Land Segmentation.” IEEE Journal of Selected Topics in Applied Earth Observations & Remote Sensing 11 (11): 3954–3962. https://doi.org/10.1109/JSTARS.2018.2833382.
  • Lin, G., A. Milan, C. Shen, and I. Reid. 2017. “RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation.” 5168–5177. https://doi.org/10.1109/CVPR.2017.549.
  • Li, H., K. Qiu, L. Chen, X. Mei, L. Hong, and C. Tao. 2021. “SCAttNet: Semantic Segmentation Network with Spatial and Channel Attention Mechanism for High-Resolution Remote Sensing Images.” IEEE Geoscience and Remote Sensing Letters 18 (5): 905–909. https://doi.org/10.1109/LGRS.2020.2988294.
  • Liu, Y., M. M. Cheng, X. Hu, K. Wang, and X. Bai. 2018. “Richer Convolutional Features for Edge Detection.” IEEE Computer Society 41:1939–1946. https://doi.org/10.1109/TPAMI.2018.2878849.
  • Liu, S., W. Ding, C. Liu, Y. Liu, Y. Wang, and H. Li. 2018. “ERN: Edge Loss Reinforced Semantic Segmentation Network for Remote Sensing Images.” Remote Sensing 10 (9): 1339. https://doi.org/10.3390/rs10091339.
  • Liu, P., X. Liu, M. Liu, Q. Shi, J. Yang, X. Xu, and Y. Zhang. 2019. “Building Footprint Extraction from High-Resolution Images via Spatial Residual Inception Convolutional Neural Network.” Remote Sensing 11 (7): 830. https://doi.org/10.3390/rs11070830.
  • Liu, Y., D. M. Nguyen, N. Deligiannis, W. Ding, and A. Munteanu. June, 2017. “Hourglass-ShapeNetwork Based Semantic Segmentation for High Resolution Aerial Imagery.” Remote Sensing 9 (6): 522. https://doi.org/10.3390/rs9060522.
  • Long, J., E. Shelhamer, and D. Trevor. 2017. “Fully Convolutional Networks for Semantic Segmentation.” IEEE Transactions on Pattern Analysis & Machine Intelligence 39 (4): 640–651. https://doi.org/10.1109/TPAMI.2016.2572683.
  • Lyu, H., H. Fu, X. Hu, and L. Liu. 2019. “Esnet: Edge-Based Segmentation Network for Real-Time Semantic Segmentation in Traffic Scenes.” 1855–1859. https://doi.org/10.1109/ICIP.2019.8803132.
  • Ma, L., M. Li, X. Ma, L. Cheng, P. Du, and Y. Liu. 2017. “A Review of Supervised Object-Based Land-Cover Image Classification.” Isprs Journal of Photogrammetry & Remote Sensing 130 (aug): 277–293. https://doi.org/10.1016/j.isprsjprs.2017.06.001.
  • Marmanis, D., K. Schindler, J. D. Wegner, S. Galliani, M. Datcu, and U. Stilla. 2017. “Classification with an Edge: Improving Semantic Image Segmentation with Boundary Detection.” ISPRS Journal of Photogrammetry and Remote Sensing 135:158–172. https://doi.org/10.1016/j.isprsjprs.2017.11.009.
  • Molchanov, P., S. Tyree, T. Karras, T. Aila, and J. Kautz. 2016. “Pruning Convolutional Neural Networks for Resource Efficient Transfer Learning.” International Conference on Learning Representations (ICLR), 1–17. Toulon, France.
  • Myint, S. W., P. Gober, A. Brazel, S. Grossman-Clarke, and Q. Weng. 2011. “Per-Pixel Vs. Object-Based Classification of Urban Land Cover Extraction Using High Spatial Resolution Imagery.” Remote Sensing of Environment 115 (5): 1145–1161. https://doi.org/10.1016/j.rse.2010.12.017.
  • Niu, R., X. Sun, Y. Tian, W. Diao, K. Chen, and K. Fu. 2021. “Hybrid Multiple Attention Network for Semantic Segmentation in Aerial Images.” IEEE Transactions on Geoscience and Remote Sensing 60:1–18. https://doi.org/10.1109/TGRS.2021.3065112.
  • Nong, Z., X. Su, Y. Liu, Z. Zhan, and Q. Yuan. 2021. “Boundary-Aware Dual Stream Network for VHR Remote Sensing Images Semantic Segmentation.” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 14:5260–6268. https://doi.org/10.1109/JSTARS.2021.3076035.
  • Odena, A., V. Dumoulin, and C. Olah. 2016. “Deconvolution and Checkerboard Artifacts.” Distill 1 (10). https://doi.org/10.23915/distill.00003.
  • Pi, Y., N. D. Nath, and A. H. Behzadan. 2021. “Detection and Semantic Segmentation of Disaster Damage in UAV Footage.” Journal of Computing in Civil Engineering 35 (2): 04020063. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000947.
  • Pohlen, T., A. Hermans, M. Mathias, and B. Leibe. 2016. “Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes.” 3309–3318. https://doi.org/10.1109/CVPR.2017.353.
  • Ronneberger, O., P. Fischer, and T. Brox. 2015. “U-Net: Convolutional Networks for Biomedical Image Segmentation.” Lecture Notes in Computer Science: 234–241. https://doi.org/10.1007/978-3-319-24574-4_28.
  • Samy, M., K. Amer, K. Eissa, M. Shaker, and M. Elhelw. 2018. “NU-Net: Deep Residual Wide Field of View Convolutional Neural Network for Semantic Segmentation.” 267–271. https://doi.org/10.1109/CVPRW.2018.00050.
  • Shivakumar, B. R., and S. V. Rajashekararadhya. 2017. “Performance Evaluation of Spectral Angle Mapper and Spectral Correlation Mapper Classifiers Over Multiple Remote Sensor Data.” 2017 Second International Conference on Electrical, Computer and Communication Technologies (ICECCT), 1–6. Coimbatore, India. https://doi.org/10.1109/ICECCT.2017.8117946.
  • Simonyan, K., and A. Zisserman. 2014. “Very Deep Convolutional Networks for Large-Scale Image Recognition.”
  • Song, A., and Y. Kim. October, 2020. “Semantic Segmentation of Remote-Sensing Imagery Using Heterogeneous Big Data: International Society for Photogrammetry and Remote Sensing Potsdam and Cityscape Datasets.” ISPRS Journal of Geo-Information 9 (10): 601. https://doi.org/10.3390/ijgi9100601.
  • Szegedy, C., W. Liu, Y. Jia, P. Sermanet, and A. Rabinovich. 2015. “Going Deeper with Convolutions.” 1–9. https://doi.org/10.1109/cvpr.2015.7298594.
  • Venugopal, N. 2020. “Automatic Semantic Segmentation with DeepLab Dilated Learning Network for Change Detection in Remote Sensing Images.” Neural Processing Letters 51 (3): 2355–2377. https://doi.org/10.1007/s11063-019-10174-x.
  • Wang, P., P. Chen, Y. Yuan, D. Liu, and G. Cottrell. 2017. “Understanding Convolution for Semantic Segmentation.”
  • Wang, X., R. Girshick, A. Gupta, K. He, and IEEE. 2018. “Non-Local Neural Networks.” IEEE Conference on Computer Vision and Pattern Recognition, 7794–7803. https://doi.org/10.1109/CVPR.2018.00813.
  • Wang, X., L. Liang, H. Yan, X. Wu, and J. Cai. 2021. “Duplex Restricted Network with Guided Upsampling for the Semantic Segmentation of Remotely Sensed Images.” IEEE Access 6:42438–42448. https://doi.org/10.1109/ACCESS.2021.3065695.
  • Wang, J., K. Sun, T. Cheng, B. Jiang, C. Deng, Y. Zhao, D. Liu, et al. October, 2021. “Deep High-Resolution Representation Learning for Visual Recognition.” IEEE Transactions on Pattern Analysis and Machine Intelligence 43 (10): 3349–3364. https://doi.org/10.1109/TPAMI.2020.2983686.
  • Whiteside, T. G., G. S. Boggs, and S. W. Maier. 2011. “Comparing Object-Based and Pixel-Based Classifications for Mapping Savannas.” International Journal of Applied Earth Observation and Geoinformation 13 (6): 884–893. https://doi.org/10.1016/j.jag.2011.06.008.
  • Woo, S. H., J. Park, J. Y. Lee, and I. S. Kweon. 2018. “CBAM: Convolutional Block Attention Module.” Lecture Notes in Computer Science: 3–19. https://doi.org/10.1007/978-3-030-01234-2_1.
  • Xie, S., Z. Tu, and IEEE. 2015. “Holistically-Nested Edge Detection.” IEEE International Conference on Computer Vision., 1395–1403. https://doi.org/10.1109/ICCV.2015.164.
  • Xu, Z., C. Su, and X. Zhang. 2021. “A Semantic Segmentation Method with Category Boundary for Land Use and Land Cover (LULC) Mapping of Very-High Resolution (VHR) Remote Sensing Image.” International Journal of Remote Sensing 42 (8): 3146–3165. https://doi.org/10.1080/01431161.2020.1871100.
  • Yang, M. D., H. H. Tseng, Y. C. Hsu, and H. P. Tsai. 2020. “Semantic Segmentation Using Deep Learning with Vegetation Indices for Rice Lodging Identification in Multi-Date UAV Visible Images.” Remote Sensing 12 (4): 633. https://doi.org/10.3390/rs12040633.
  • Yi, Y., Z. Zhang, W. Zhang, C. Zhang, and T. Zhao. 2019. “Semantic Segmentation of Urban Buildings from VHR Remote Sensing Imagery Using a Deep Convolutional Neural Network.” Remote Sensing 11 (15): 1774. https://doi.org/10.3390/rs11151774.
  • Yu, C., C. Gao, J. Wang, G. Yu, C. Shen, and N. Sang. 2021. “BiSenet V2: Bilateral Network with Guided Aggregation for Real-Time Semantic Segmentation.” International Journal of Computer Vision 129 (11): 3051–3068. https://doi.org/10.1007/s11263-021-01515-2.
  • Yu, F., and V. Koltun. 2016. “Multi-Scale Context Aggregation by Dilated Convolutions.” 1–13. https://doi.org/10.48550/arXiv.1511.07122.
  • Yu, C., J. Wang, C. Peng, C. Gao, G. Yu, N. Sang. 2018. “Learning a Discriminative Feature Network for Semantic Segmentation.” IEEE Conference on Computer Vision and Pattern Recognition, 1857–1866. https://doi.org/10.1109/CVPR.2018.00199.
  • Yu, T., W. Wu, C. Gong, and X. Li. 2021. “Residual Multi-Attention Classification Network for a Forest Dominated Tropical Landscape Using High-Resolution Remote Sensing Imagery.” ISPRS International Journal of Geo-Information 10 (1): 22. https://doi.org/10.3390/ijgi10010022.
  • Zhang, J., S. Lin, L. Ding, and L. Bruzzone. 2020. “Multi-Scale Context Aggregation for Semantic Segmentation of Remote Sensing Images.” Remote Sensing 12 (4): 701. https://doi.org/10.3390/rs12040701.
  • Zhao, H., J. Shi, X. Qi, X. Wang, and J. Jia. 2017. “Pyramid Scene Parsing Network.” 6230–6239. https://doi.org/10.1109/CVPR.2017.660.
  • Zheng, X., L. Huan, H. Xiong, and J. Gong. 2019. “ELKPPNet: An Edge-Aware Neural Network with Large Kernel Pyramid Pooling for Learning Discriminative Features in Semantic Segmentation.” https://doi.org/10.48550/arXiv.1906.11428.
  • Zheng, Z., X. Zhang, P. Xiao, and Z. Li. 2021. “Integrating Gate and Attention Modules for High-Resolution Image Semantic Segmentation.” IEEE Journal of Selected Tpics in Applied Earth Observations and Remote Sensing 14:4530–4546. https://doi.org/10.1109/JSTARS.2021.3071353.
  • Zhou, Z., M. M. R. Siddiquee, N. Tajbakhsh, and J. Liang. 2018. “UNet++: A Nested U-Net Architecture for Medical Image Segmentation.” Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision 11045:3–11. https://doi.org/10.1007/978-3-030-00889-5_1.
  • Zuo, R., G. Zhang, R. Zhang, and X. Jia. 2021. “A Deformable Attention Network for High-Resolution Remote Sensing Images Semantic Segmentation.” IEEE Transactions on Geoscience & Remote Sensing 60:1–14. https://doi.org/10.1109/TGRS.2021.3119537.