Publication Cover
Canadian Journal of Remote Sensing
Journal canadien de télédétection
Volume 49, 2023 - Issue 1
331
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Multiscale Cascaded Network for the Semantic Segmentation of High-Resolution Remote Sensing Images

Réseau multi-échelle en cascade pour la segmentation sémantique d’images de télédétection à haute résolution

, &
Article: 2255068 | Received 09 Aug 2022, Accepted 11 Aug 2023, Published online: 11 Sep 2023

References

  • Badrinarayanan, V., Kendall, A., and Cipolla, R. 2017. “SegNet: A deep convolutional encoder-decoder architecture for image segmentation.” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.39(No.12):pp. 2481–2495. doi:10.1109/TPAMI.2016.2644615.
  • Chen, L. C., Papandreou, G., Schroff, F., Adam, H. 2017. “Rethinking atrous convolution for semantic image segmentation.” arXiv preprint arXiv: 1706. 05587.
  • Chen, L. C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H. 2018b. “Encoder-decoder with atrous separable convolution for semantic image segmentation.” In Proceedings of the European Conference on Computer Vision (ECCV), pp. 801–818.
  • Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K., and Yuille, A.L. 2018a. “DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs.” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.40(No.4): pp. 834–848. doi:10.1109/TPAMI.2017.2699184.
  • Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., and Yuille, A.L. 2014. “Semantic image segmentation with deep convolutional nets and fully connected CRFs.” arXiv preprint arXiv: 1412. 7062.
  • Chen, S., Zuo, Q., and Wang, Z. 2021. “Semantic segmentation of high resolution remote sensing images based on improved ResU-Net.” In International Conference of Pioneering Computer Scientists, pp. 303–313.
  • Demir, I., Koperski, K., Lindenbaum, D., Pang, G., Huang, J., Basu, S., Hughes, F., et al. 2018. “DeepGlobe 2018: A challenge to parse the earth through satellite images.” arXiv preprint arXiv:1805.06561.
  • Hu, J., Shen, L., and Sun, G. 2018. “Squeeze-and-excitation networks.” In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 7132–7141.
  • Li, X., Lei, L., and Kuang, G. 2022. “Multilevel adaptive-scale context aggregating network for semantic segmentation in high-resolution remote sensing images.” IEEE Geoscience and Remote Sensing Letters, Vol.19: pp. 1–5. doi:10.1109/LGRS.2021.3091284.
  • Long, J., Shelhamer, E., and Darrell, T. 2015. “Fully convolutional networks for semantic segmentation.” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recongnition, Boston, MA: IEEE Press, pp. 3431–3440.
  • Mehta, N., and Murala, S. 2021. “MSAR-net: Multi-scale attention based light-weight image super-resolution.” Pattern Recognition Letters, Vol. 151: pp. 215–221. doi:10.1016/j.patrec.2021.08.011.
  • Peng, C., Zhang, K., Ma, Y., and Ma, J. 2022. “Cross fusion net: A fast semantic segmentation network for small-scale semantic information capturing in aerial scenes.” IEEE Transactions on Geoscience and Remote Sensing, Vol. 60: pp. 1–13. doi:10.1109/TGRS.2021.3053062.
  • Qi, X., Li, K., Liu, P., Zhou, X., and Sun, M. 2020. “Deep attention and multi-scale networks for accurate remote sensing image segmentation.” IEEE Access, Vol. 8: pp. 146627–146639. doi:10.1109/ACCESS.2020.3015587.
  • Ronneberger, O., Fischer, P., and Brox, T. 2015. “U-net: Convolutional networks for biomedical image segmentation.” In International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham: Springer, pp. 234–241.
  • Rottensteiner, F., Sohn, G., Jung, J., Gerke, M., Baillard, C., Benitez, S., Breitkopf, U., et al. 2012. “The ISPRS benchmark on urban object classification and 3D building reconstruction.” ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Vol.I–3: pp. 293–298. doi:10.5194/isprsannals-I-3-293-2012.
  • Shang, R., Zhang, J., Jiao, L., Li, Y., Marturi, N., and Stolkin, R. 2020. “Multi-scale adaptive feature fusion network for semantic segmentation in remote sensing images.” Remote Sensing, Vol.12(No. 5): p. 872. doi:10.3390/rs12050872.
  • Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., and Wojna, Z. 2016. “Rethinking the inception architecture for computer vision.” In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, pp. 2818–2826. doi:10.1109/CVPR.2016.308.
  • Tao, A., Sapra, K., and Catanzaro, B. 2020. “Hierarchical multi-scale attention for semantic segmentation.” doi:10.48550/arXiv.2005.10821.
  • Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., and Hu, Q. 2020. “ECA-net: Efficient channel attention for deep convolutional neural networks.” In Proceedings of the CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, pp. 11531–11539.
  • Xiang, X. 2021. Research on remote sensing image semantic segmentation based on deep learning. Guiyang: Guizhou University. doi:10.27047/d.cnki.ggudu.2021.002806.
  • Xiao, T., Liu, Y., Zhou, B., Jiang, Y., and Sun, J. 2018. “Unified perceptual parsing for scene understanding.” In Proceedings of the European Conference on Computer Vision (ECCV), pp. 418–434.
  • Yang, M., Yu, K., Zhang, C., Li, Z., and Yang, K. 2018. “DenseASPP for semantic segmentation in street scenes.” In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3684–3692.
  • Zhao, B., Zhang, X., Zhan, Z., and Wu, Q. 2021. “Deep multi-scale separable convolutional network with triple attention mechanism: A novel multi-task domain adaptation method for intelligent fault diagnosis.” Expert Systems with Applications, Vol.182: pp. 115087. doi:10.1016/j.eswa.2021.115087.
  • Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. 2017. “Pyramid scene parsing network.” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2881–2890.
  • Zhao, J., Zhou, Y., Shi, B., Yang, J., Zhang, D., and Yao, R. 2021. “Multi-stage fusion and multi-source attention network for multi-modal remote sensing image segmentation.” ACM Transactions on Intelligent Systems and Technology, Vol.12(No. 6): pp. 1–20. doi:10.1145/3484440.