Search in:

Advanced search

Connection Science Volume 35, 2023 - Issue 1

Submit an article Journal homepage

Open access

455

Views

CrossRef citations to date

Altmetric

Research Article

Dual conditional GAN based on external attention for semantic image synthesis

Gang LiuSchool of Computer Science, Hubei University of Technology, Wuhan, People’s Republic of China

https://orcid.org/0000-0002-8589-0457 View further author information

Qijun ZhouSchool of Computer Science, Hubei University of Technology, Wuhan, People’s Republic of ChinaCorrespondence[email protected]

https://orcid.org/0009-0006-0233-6378 View further author information

Xiaoxiao XieSchool of Computer Science, Hubei University of Technology, Wuhan, People’s Republic of China

https://orcid.org/0009-0003-6132-077X View further author information

Qingchen YuSchool of Computer Science, Hubei University of Technology, Wuhan, People’s Republic of China

https://orcid.org/0009-0009-4613-7874 View further author information

Article: 2259120 | Received 03 Jun 2023, Accepted 10 Sep 2023, Published online: 04 Oct 2023

Cite this article
https://doi.org/10.1080/09540091.2023.2259120
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

References

Brock, A., Donahue, J., & Simonyan, K. (2019). Large scale GaN training for high fidelity natural image synthesis. In 7th International Conference on Learning Representations, ICLR 2019. 7th International Conference on Learning Representations, ICLR 2019, May 6, 2019 to May 9, 2019, New Orleans, LA, USA.
Google Scholar
Chen, D., Hua, G., Liao, J., Yuan, L., Chai, M., He, M., Yu, N., Chu, Q., & Tan, Z. (2020). Efficient semantic image synthesis via class-adaptive normalization.
Google Scholar
Chen, Q., & Koltun, V. (2017). Photographic image synthesis with cascaded refinement networks. In Proceedings of the IEEE International Conference on Computer Vision. 16th IEEE International Conference on Computer Vision, ICCV 2017, October 22, 2017 to October 29, 2017, Venice, Italy.
Google Scholar
Cordts, M., Omran, M., Ramos, S., Rehfeld, T., Enzweiler, M., Benenson, R., Franke, U., Roth, S., & Schiele, B. (2016). The Cityscapes dataset for semantic urban scene understanding. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, June 26, 2016 to July 1, 2016, Las Vegas, NV, USA.
Google Scholar
Han, K., Wang, Y., Guo, J., Tang, Y., & Wu, E. (2022). Vision GNN: An image is worth graph of nodes. In Advances in neural information processing systems. 36th Conference on Neural Information Processing Systems, NeurIPS 2022, November 28, 2022 to December 9, 2022, New Orleans, LA, USA.
Google Scholar
Heusel, M., Ramsauer, H., Unterthiner, T., Nessler, B., & Hochreiter, S. (2017). GANs trained by a two time-scale update rule converge to a local Nash equilibrium. In Advances in neural information processing systems. 31st Annual Conference on Neural Information Processing Systems, NIPS 2017, December 4, 2017 to December 9, 2017, Long Beach, CA, USA.
Google Scholar
Isola, P., Zhu, J.-Y., Zhou, T., & Efros, A. A. (2017). Image-to-image translation with conditional adversarial networks. In Proceedings – 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017. 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, July 21, 2017 to July 26, 2017, Honolulu, HI, USA.
Google Scholar
Karras, T., Laine, S., & Aila, T. (2019). A style-based generator architecture for generative adversarial networks. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019, June 16, 2019 to June 20, 2019, Long Beach, CA, USA.
Google Scholar
Karras, T., Laine, S., Aittala, M., Hellsten, J., & Aila, T. (2020). Analyzing and improving the image quality of StyleGAN. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
Google Scholar
Kim, J., Kim, M., Kang, H., & Lee, K. (2019). U-GAT-IT: Unsupervised generative attentional networks with adaptive layer-instance normalization for image-to-image translation.
Google Scholar
Lee, C.-H., Liu, Z., Wu, L., & Luo, P. (2020). MaskGAN: Towards diverse and interactive facial image manipulation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, June 14, 2020 to June 19, 2020, Virtual, Online, USA.
Google Scholar
Liu, X., Shao, J., Yin, G., Wang, X., & Li, H. (2019). Learning to predict layout-to-image conditional convolutions for semantic image synthesis. In Advances in neural information processing systems. 33rd Annual Conference on Neural Information Processing Systems, NeurIPS 2019, December 8, 2019 to December 14, 2019, Vancouver, BC, Canada.
Google Scholar
Lv, Z., Li, X., Niu, Z., Cao, B., & Zuo, W. (2022). Semantic-shape adaptive feature modulation for semantic image synthesis. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.
Google Scholar
Mirza, M., & Osindero, S. (2014). Conditional generative adversarial nets. abs/1411.1784, undefined.
Google Scholar
Nichol, A., Dhariwal, P., Ramesh, A., Shyam, P., Mishkin, P., McGrew, B., Sutskever, I., & Chen, M. (2021). GLIDE: Towards photorealistic image generation and editing with text-guided diffusion models. arXiv.
Google Scholar
Ntavelis, E., Romero, A., Kastanis, I., Van Gool, L., & Timofte, R. (2020). SESAME: Semantic editing of scenes by adding, manipulating or erasing objects. In Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics). 16th European Conference on Computer Vision, ECCV 2020, August 23, 2020 to August 28, 2020, Glasgow, UK.
Google Scholar
Park, T., Liu, M.-Y., Wang, T.-C., & Zhu, J.-Y. (2019, June 16–20). Semantic image synthesis with spatially-adaptive normalization. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA.
Google Scholar
Qi, X., Chen, Q., Jia, J., & Koltun, V. (2018). Semi-parametric image synthesis. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 31st Meeting of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2018, June 18, 2018 to June 22, 2018, Salt Lake City, UT, USA.
Google Scholar
Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., Krueger, G., & Sutskever, I. (2021). Learning transferable visual models from natural language supervision. arXiv.
Google Scholar
Ronneberger, O., Fischer, P., & Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. In Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics). 18th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2015, October 5, 2015 to October 9, 2015, Munich, Germany.
Google Scholar
Schonfeld, E., Sushko, V., Zhang, D., Gall, J., Schiele, B., & Khoreva, A. (2021). You only need adversarial supervision for semantic image synthesis. In ICLR 2021 – 9th International Conference on Learning Representations. 9th International Conference on Learning Representations, ICLR 2021, May 3, 2021 to May 7, 2021, Virtual, Online.
Google Scholar
Tang, H., Bai, S., & Sebe, N. (2020). Dual attention gans for semantic image synthesis. In Proceedings of the 28th ACM International Conference on Multimedia.
Google Scholar
Tang, H., Qi, X., Sun, G., Xu, D., Sebe, N., Timofte, R., & Van Gool, L. (2020). Edge guided gans with contrastive learning for semantic image synthesis. arXiv. https://doi.org/10.48550/arXiv.2003.13898
Google Scholar
Tang, H., Xu, D., Sebe, N., Wang, Y., Corso, J. J., & Yan, Y. (2019). Multi-channel attention selection gan with cascaded semantic guidance for cross-view image translation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.
Google Scholar
Tang, H., Xu, D., Yan, Y., Torr, P. H. S., & Sebe, N. (2020). Local class-specific and global image-level generative adversarial networks for semantic-guided scene generation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, June 14, 2020 to June 19, 2020, Virtual, Online, USA.
Google Scholar
Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., & Hu, Q. (2020). ECA-net: Efficient channel attention for deep convolutional neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.
Google Scholar
Wang, T.-C., Liu, M.-Y., Zhu, J.-Y., Tao, A., Kautz, J., & Catanzaro, B. (2018). High-resolution image synthesis and semantic manipulation with conditional GANs. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 31st Meeting of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2018, June 18, 2018 to June 22, 2018, Salt Lake City, UT, USA.
Google Scholar
Wang, W., Bao, J., Zhou, W., Chen, D., Chen, D., Yuan, L., & Li, H. (2022). Semantic image synthesis via diffusion models. arXiv. https://doi.org/10.48550/arXiv.2207.00050
Google Scholar
Wang, Y., Qi, L., Chen, Y.-C., Zhang, X., & Jia, J. (2021). Image synthesis via semantic composition. In Proceedings of the IEEE International Conference on Computer Vision. 18th IEEE/CVF International Conference on Computer Vision, ICCV 2021, October 11, 2021 to October 17, 2021, Virtual, Online, Canada.
Google Scholar
Xiao, T., Liu, Y., Zhou, B., Jiang, Y., & Sun, J. (2018). Unified perceptual parsing for scene understanding. In Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics). 15th European Conference on Computer Vision, ECCV 2018, September 8, 2018 to September 14, 2018, Munich, Germany.
Google Scholar
Yu, F., Koltun, V., & Funkhouser, T. (2017). Dilated residual networks. In Proceedings – 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017. 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, July 21, 2017 to July 26, 2017, Honolulu, HI, USA.
Google Scholar
Zhu, P., Abdal, R., Qin, Y., & Wonka, P. (2020). SEAN: Image synthesis with semantic region-adaptive normalization. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, June 14, 2020 to June 19, 2020, Virtual, Online, USA.
Google Scholar

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Dual conditional GAN based on external attention for semantic image synthesis

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Dual conditional GAN based on external attention for semantic image synthesis

References

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date