GPU based building footprint identification utilising self-attention multiresolution analysis

Rizwan Ahmed Ansaria Department of Electronics and Telecommunication, Symbiosis Institute of Technology, Symbiosis International University, Pune, IndiaCorrespondence[email protected]

Akshat Ramachandranb Department of Electrical Engineering, Veermata Jijabai Technological Institute, Mumbai, India

Winnie Thomasc Department of Electrical Engineering, Indian Institute of Technology Bombay, Mumbai, India

Pages 102-111 | Received 09 Jan 2023, Accepted 11 Apr 2023, Published online: 27 Apr 2023

Cite this article
https://doi.org/10.1080/27669645.2023.2202961
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF

References

Azimi, S. M., Fischer, P., Korner, M., & Reinartz, P. (2019, May). Aerial LaneNet: Lane-marking semantic segmentation in aerial imagery using waveletenhanced cost-sensitive symmetric fully convolutional neural networks. IEEE Transactions on Geoscience and Remote Sensing: A Publication of the IEEE Geoscience and Remote Sensing Society, 57(5), 2920–2938. https://doi.org/10.1109/TGRS.2018.2878510
Web of Science ®Google Scholar
Cao, R., Fang, L., Lu, T., & He, N. (2021, Jan). Self-attention-based deep feature fusion for remote sensing scene classification. IEEE Geoscience and Remote Sensing Letters, 18(1), 43–47. https://doi.org/10.1109/LGRS.2020.2968550
Web of Science ®Google Scholar
Chen, J., Jiang, Y., Luo, L., & Gong, W. (2022). ASF-Net: Adaptive screening feature network for building footprint extraction from remote-sensing images. IEEE Transactions on Geoscience and Remote Sensing, 60, 1–13. https://doi.org/10.1109/TGRS.2022.3165204
Web of Science ®Google Scholar
Dai, C. D., & Niessner, M. (2020). SG-NN: Sparse generative neural networks for self-supervised scene completion of RGB-D scans. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 846–855. https://doi.org/10.1109/CVPR42600.2020.00093.
Google Scholar
Fu, J. Liu, J., Tian, H., Li, Y., Bao, Y., Fang, Z. and Lu, H. (2019). Dual attention network for scene segmentation. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 3141–3149. https://doi.org/10.1109/CVPR.2019.00326.
Google Scholar
GFDRR Labs. (2020). Open cities AI challenge dataset”, version 1.0, 20 Dec. 2021. Radiant MLHub. https://doi.org/10.34911/rdnt.f94cxb
Google Scholar
Ji, S., Wei, S., & Lu, M. (2019, Jan). Fully convolutional networks for multisource building extraction from an open aerial and satellite imagery data set. IEEE Transactions on Geoscience and Remote Sensing, 57(1), 574–586. https://doi.org/10.1109/TGRS.2018.2858817
Web of Science ®Google Scholar
Kaiming, H., Zhang, X., Ren, S. and Sun, J. (2016). Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, USA.
Google Scholar
Kang, J., Fernandez-Beltran, R., Sun, X., Jingen, N., & Plaza, A. (2021). Deep learning-based building footprint extraction with missing annotations. IEEE Geoscience and Remote Sensing Letters, 19, 1–5. https://doi.org/10.1109/LGRS.2021.3072589
Web of Science ®Google Scholar
Libo, W., Rui, L., Zhang, C., Fang, S., Duan, C., Meng, X., & Atkinson, P. M. (2022). UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery. ISPRS Journal of Photogrammetry and Remote Sensing, 190, 196–214. https://doi.org/10.1016/j.isprsjprs.2022.06.008
Web of Science ®Google Scholar
Li, R., Duan, C., Zheng, S., Zhang, C., & Atkinson, P. M. (2021). MACU-Net for semantic segmentation of fine-resolution remotely sensed images. IEEE Geoscience and Remote Sensing Letters, 1, 1–5. https://doi.org/10.1109/LGRS.2021.3052886
Google Scholar
Li Liu, Y., Yin, H., Li, Y., Guo, Q., Zhang, L. and Du, P. (2021). Attention residual U-Net for building segmentation in aerial images. IEEE International Geoscience and Remote Sensing Symposium IGARSS, 4047–4050. https://doi.org/10.1109/IGARSS47720.2021.9554058.
Google Scholar
Lin, T.Y., Goyal, P., Girshick, R., He, K., & Dollár, P. (2017). Focal loss for dense object detection. Proceedings of the IEEE international conference on computer vision, Venice, Italy.
Google Scholar
Li, H., Qiu, K., Chen, L., Mei, X., Hong, L., & Tao, C. (2021, May). SCAttNet: Semantic segmentation network with spatial and channel attention mechanism for high-resolution remote sensing images. IEEE Geoscience and Remote Sensing Letters, 18(5), 905–909. https://doi.org/10.1109/LGRS.2020.2988294
Web of Science ®Google Scholar
Liu, Y., Yao, J., Lu, X., Xia, M., Wang, X., & Liu, Y. (2019, Apr). RoadNet: Learning to comprehensively analyze road networks in complex urban scenes from high-resolution remotely sensed images. IEEE Transactions on Geoscience and Remote Sensing: A Publication of the IEEE Geoscience and Remote Sensing Society, 57(4), 2043–2056. https://doi.org/10.1109/TGRS.2018.2870871
Web of Science ®Google Scholar
Long, J., Shelhamer, E., & Darrell, T. (2015, Jun). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Boston, USA (pp. 3431–3440).
Google Scholar
Lu, K., Sun, Y., & Ong, S. (2018). Dual-resolution U-Net: Building extraction from aerial images. 24th International Conference on Pattern Recognition (ICPR), 489–494. https://doi.org/10.1109/ICPR.2018.8545190.
Google Scholar
Meng, X., Yang, Y., Wang, L., Wang, T., Rui, L., & Zhang, C. (2022). Class-guided swin transformer for semantic segmentation of remote sensing imagery. IEEE Geoscience and Remote Sensing Letters, 19, 1–5. https://doi.org/10.1109/LGRS.2022.3215200
Google Scholar
Ronneberger, O., Fischer, P., & Brox, T. (2015). U-Net: Convolutional net- works for biomedical image segmentation. In N. Navab, H. Joachim, M. W. William, & A. Frangi (Eds.), Proceedings of International Conference Medical Image Computing and Computer-Assisted Intervention (pp. 234–241). Springer.
Google Scholar
Sharma, D., & Singhai, J. (2021). An unsupervised framework to extract the diverse building from the satellite images using Grab-cut method. Earth Sci Inform, 14(2), 777–795. https://doi.org/10.1007/s12145-021-00569-7
Web of Science ®Google Scholar
Simonyan, K., Zisserman, A., & Zisserman, A. Learning local feature descriptors using convex optimisation. (2014). IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(8), 1573–1585. arXiv preprint arXiv:1409.1556. https://doi.org/10.1109/TPAMI.2014.2301163
PubMed Web of Science ®Google Scholar
Wang, L., Fang, S., Meng, X., & Rui, L. (2022). Building extraction with vision transformer. IEEE Transactions on Geoscience and Remote Sensing, 60, 1–11. https://doi.org/10.1109/TGRS.2022.3186634
Web of Science ®Google Scholar
Xu, S., Pan, X., Li, E., Wu, B., Bu, S., Dong, W., Xiang, S., & Zhang, X. (2018, Dec). Automatic building rooftop extraction from aerial images via hierarchical RGB-D priors. IEEE Transactions on Geoscience and Remote Sensing, 56(12), 7369–7387. https://doi.org/10.1109/TGRS.2018.2850972
Web of Science ®Google Scholar
Ye, M., Ruiwen, N., Chang, Z., Gong, H., Tianli, H., Shijun, L., Sun, Y., Tong, Z., & Ying, G. (2021). A lightweight model of VGG-16 for remote sensing image classification. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 14, 6916–6922. https://doi.org/10.1109/JSTARS.2021.3090085
Google Scholar
Yongtao, Y., Yinyin, L., Liu, C., Wang, J., Changhui, Y., Jiang, X., Wang, L., Liu, Z., & Zhang, Y. (2021). MarkCapsNet: Road marking extraction from aerial images using self-attention-guided capsule network. IEEE Geoscience and Remote Sensing Letters, 19, 1–5. https://doi.org/10.1109/LGRS.2021.3124575
Web of Science ®Google Scholar
Zhang, Q., & Guo, L. (2007). Self-enhanced SVM extraction of building objects from high resolution satellite images. Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007), 13. https://doi.org/10.1109/ICICIC.2007.511.
Google Scholar
Zhang, Z., Liu, Q., & Wang, Y. (2018, May). Road extraction by deep residual U-Net. IEEE Geoscience and Remote Sensing Letters, 15(5), 749–753. https://doi.org/10.1109/LGRS.2018.2802944
Web of Science ®Google Scholar

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

GPU based building footprint identification utilising self-attention multiresolution analysis

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

GPU based building footprint identification utilising self-attention multiresolution analysis

References

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date