1,538
Views
0
CrossRef citations to date
0
Altmetric
Earth Observations

GPU based building footprint identification utilising self-attention multiresolution analysis

, &
Pages 102-111 | Received 09 Jan 2023, Accepted 11 Apr 2023, Published online: 27 Apr 2023

References

  • Azimi, S. M., Fischer, P., Korner, M., & Reinartz, P. (2019, May). Aerial LaneNet: Lane-marking semantic segmentation in aerial imagery using waveletenhanced cost-sensitive symmetric fully convolutional neural networks. IEEE Transactions on Geoscience and Remote Sensing: A Publication of the IEEE Geoscience and Remote Sensing Society, 57(5), 2920–2938. https://doi.org/10.1109/TGRS.2018.2878510
  • Cao, R., Fang, L., Lu, T., & He, N. (2021, Jan). Self-attention-based deep feature fusion for remote sensing scene classification. IEEE Geoscience and Remote Sensing Letters, 18(1), 43–47. https://doi.org/10.1109/LGRS.2020.2968550
  • Chen, J., Jiang, Y., Luo, L., & Gong, W. (2022). ASF-Net: Adaptive screening feature network for building footprint extraction from remote-sensing images. IEEE Transactions on Geoscience and Remote Sensing, 60, 1–13. https://doi.org/10.1109/TGRS.2022.3165204
  • Dai, C. D., & Niessner, M. (2020). SG-NN: Sparse generative neural networks for self-supervised scene completion of RGB-D scans. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 846–855. https://doi.org/10.1109/CVPR42600.2020.00093.
  • Fu, J. Liu, J., Tian, H., Li, Y., Bao, Y., Fang, Z. and Lu, H. (2019). Dual attention network for scene segmentation. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 3141–3149. https://doi.org/10.1109/CVPR.2019.00326.
  • GFDRR Labs. (2020). Open cities AI challenge dataset”, version 1.0, 20 Dec. 2021. Radiant MLHub. https://doi.org/10.34911/rdnt.f94cxb
  • Ji, S., Wei, S., & Lu, M. (2019, Jan). Fully convolutional networks for multisource building extraction from an open aerial and satellite imagery data set. IEEE Transactions on Geoscience and Remote Sensing, 57(1), 574–586. https://doi.org/10.1109/TGRS.2018.2858817
  • Kaiming, H., Zhang, X., Ren, S. and Sun, J. (2016). Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, USA.
  • Kang, J., Fernandez-Beltran, R., Sun, X., Jingen, N., & Plaza, A. (2021). Deep learning-based building footprint extraction with missing annotations. IEEE Geoscience and Remote Sensing Letters, 19, 1–5. https://doi.org/10.1109/LGRS.2021.3072589
  • Libo, W., Rui, L., Zhang, C., Fang, S., Duan, C., Meng, X., & Atkinson, P. M. (2022). UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery. ISPRS Journal of Photogrammetry and Remote Sensing, 190, 196–214. https://doi.org/10.1016/j.isprsjprs.2022.06.008
  • Li, R., Duan, C., Zheng, S., Zhang, C., & Atkinson, P. M. (2021). MACU-Net for semantic segmentation of fine-resolution remotely sensed images. IEEE Geoscience and Remote Sensing Letters, 1, 1–5. https://doi.org/10.1109/LGRS.2021.3052886
  • Li Liu, Y., Yin, H., Li, Y., Guo, Q., Zhang, L. and Du, P. (2021). Attention residual U-Net for building segmentation in aerial images. IEEE International Geoscience and Remote Sensing Symposium IGARSS, 4047–4050. https://doi.org/10.1109/IGARSS47720.2021.9554058.
  • Lin, T.Y., Goyal, P., Girshick, R., He, K., & Dollár, P. (2017). Focal loss for dense object detection. Proceedings of the IEEE international conference on computer vision, Venice, Italy.
  • Li, H., Qiu, K., Chen, L., Mei, X., Hong, L., & Tao, C. (2021, May). SCAttNet: Semantic segmentation network with spatial and channel attention mechanism for high-resolution remote sensing images. IEEE Geoscience and Remote Sensing Letters, 18(5), 905–909. https://doi.org/10.1109/LGRS.2020.2988294
  • Liu, Y., Yao, J., Lu, X., Xia, M., Wang, X., & Liu, Y. (2019, Apr). RoadNet: Learning to comprehensively analyze road networks in complex urban scenes from high-resolution remotely sensed images. IEEE Transactions on Geoscience and Remote Sensing: A Publication of the IEEE Geoscience and Remote Sensing Society, 57(4), 2043–2056. https://doi.org/10.1109/TGRS.2018.2870871
  • Long, J., Shelhamer, E., & Darrell, T. (2015, Jun). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Boston, USA (pp. 3431–3440).
  • Lu, K., Sun, Y., & Ong, S. (2018). Dual-resolution U-Net: Building extraction from aerial images. 24th International Conference on Pattern Recognition (ICPR), 489–494. https://doi.org/10.1109/ICPR.2018.8545190.
  • Meng, X., Yang, Y., Wang, L., Wang, T., Rui, L., & Zhang, C. (2022). Class-guided swin transformer for semantic segmentation of remote sensing imagery. IEEE Geoscience and Remote Sensing Letters, 19, 1–5. https://doi.org/10.1109/LGRS.2022.3215200
  • Ronneberger, O., Fischer, P., & Brox, T. (2015). U-Net: Convolutional net- works for biomedical image segmentation. In N. Navab, H. Joachim, M. W. William, & A. Frangi (Eds.), Proceedings of International Conference Medical Image Computing and Computer-Assisted Intervention (pp. 234–241). Springer.
  • Sharma, D., & Singhai, J. (2021). An unsupervised framework to extract the diverse building from the satellite images using Grab-cut method. Earth Sci Inform, 14(2), 777–795. https://doi.org/10.1007/s12145-021-00569-7
  • Simonyan, K., Zisserman, A., & Zisserman, A. Learning local feature descriptors using convex optimisation. (2014). IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(8), 1573–1585. arXiv preprint arXiv:1409.1556. https://doi.org/10.1109/TPAMI.2014.2301163
  • Wang, L., Fang, S., Meng, X., & Rui, L. (2022). Building extraction with vision transformer. IEEE Transactions on Geoscience and Remote Sensing, 60, 1–11. https://doi.org/10.1109/TGRS.2022.3186634
  • Xu, S., Pan, X., Li, E., Wu, B., Bu, S., Dong, W., Xiang, S., & Zhang, X. (2018, Dec). Automatic building rooftop extraction from aerial images via hierarchical RGB-D priors. IEEE Transactions on Geoscience and Remote Sensing, 56(12), 7369–7387. https://doi.org/10.1109/TGRS.2018.2850972
  • Ye, M., Ruiwen, N., Chang, Z., Gong, H., Tianli, H., Shijun, L., Sun, Y., Tong, Z., & Ying, G. (2021). A lightweight model of VGG-16 for remote sensing image classification. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 14, 6916–6922. https://doi.org/10.1109/JSTARS.2021.3090085
  • Yongtao, Y., Yinyin, L., Liu, C., Wang, J., Changhui, Y., Jiang, X., Wang, L., Liu, Z., & Zhang, Y. (2021). MarkCapsNet: Road marking extraction from aerial images using self-attention-guided capsule network. IEEE Geoscience and Remote Sensing Letters, 19, 1–5. https://doi.org/10.1109/LGRS.2021.3124575
  • Zhang, Q., & Guo, L. (2007). Self-enhanced SVM extraction of building objects from high resolution satellite images. Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007), 13. https://doi.org/10.1109/ICICIC.2007.511.
  • Zhang, Z., Liu, Q., & Wang, Y. (2018, May). Road extraction by deep residual U-Net. IEEE Geoscience and Remote Sensing Letters, 15(5), 749–753. https://doi.org/10.1109/LGRS.2018.2802944