Search in:

International Journal of Digital Earth Volume 17, 2024 - Issue 1

Submit an article Journal homepage

Open access

270

Views

CrossRef citations to date

Altmetric

Research Article

An unsupervised semantic segmentation method that combines the ImSE-Net model with SLICm superpixel optimization

Zenan Yanga School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo, People’s Republic of ChinaView further author information

Haipeng Niua School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo, People’s Republic of ChinaCorrespondence[email protected]
View further author information

Xiaoxuan Wangb Key Laboratory of Spatio-temporal Information and Ecological Restoration of Mines of Natural Resources of the People’s Republic of China, Henan Polytechnic University, Jiaozuo, People’s Republic of ChinaView further author information

Liang Huangc Faculty of Land Resources Engineering, Kunming University of Science and Technology, Kunming, People’s Republic of ChinaView further author information

Kui Yanga School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo, People’s Republic of ChinaView further author information

Article: 2341970 | Received 21 Jul 2023, Accepted 07 Apr 2024, Published online: 16 Apr 2024

Cite this article
https://doi.org/10.1080/17538947.2024.2341970
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Figures & data

Figure 1. Illustration of the workflow for the proposed model. The pre-segmentation phase integrates the ImSE-Net model with SLICm superpixel optimization, generating a preliminary semantic segmentation result denoted by the red dashed line above. The UGLS algorithm for further refinement classification results is indicated by the lower green dashed frame. (a) Original image; (b) segmentation result using ImSE-Net; (c) contour refinement based on SLICm; (d) segmentation result combining ImSE-Net and SLICm; (e) final segmentation result.

Figure 2. Illustration of the workflow for the proposed ImSE-Net method. FCN-32 uses ResNet-101 as the backbone, in combination with the proposed ImSE-Net model from Conv2_x to Conv5_x. This method groups pixels from the downsampling layer to create a allocation matrix assigning predicted target values to each pixel group. Unlike existing neural network methods, this method does not explicitly employ superpixel segmentation for downsampling. Hence, the method can be integrated into existing architectures without altering their feedforward paths.

Table 1. Detailed process for the ImSE-Net.

Display Table

Table 2. Detailed process for superpixel refinement.

Display Table

Figure 3. Illustration of the global hidden pseudo-positives and local hidden pseudo-positive selection process. GHPsp: unlabeled image samples $x_{i}^{'}$ ; feature extractor F; task-agnostic reference pool $Q^{ag}$ ; anchor features $f_{i}$ ; the segmentation head S produces corresponding segmentation features $s_{i} = S (f_{i})$ ; an index set of $P_{i}^{ag}$ for each i-th anchor feature $f_{i}$ ; the momentum segmentation head $S^{'}$ produces corresponding segmentation features $s_{i}^{'} = S^{'} (f_{i})$ ; task-specific reference pool $Q^{sg}$ ; an index set of $P_{i}^{sp}$ by comparing $s_{i}^{'}$ and $Q^{sg}$ ; the projection head Z produces a projection anchor vector $z_{i}$ ; GHPsP contrastive loss for i-th patch in unsupervised semantic segmentation with multiple positives $L_{ag}^{cont}$ and $L_{sp}^{cont}$ ; LHPsp: an index set $M_{i}^{nei}$ that contains the i-th anchor and its neighboring patches; it uses the average attention score value of ${\tilde{T}}_{i}$ as the threshold for selecting LHPsP $M_{i}^{loc}$ among $M_{i}^{nei}$ and then takes the above-average portion as the final experiment selection; the calculated gradient (G) for mixed feature $s_{i}^{mix}$ combines the neighboring positive features $G_{i}$ with the corresponding attention scores $T_{i}$ proportionally; the LHPsP objective functions $Z (s_{i}^{mix})$ produces a projected mixed vector $z_{i}^{mix}$ ; the global cost is Lc, and the local loss is L_r.

Figure 4. Example results with the COCO-stuff test set.

Figure 5. Example results on the build test set.

Figure 6. Example results on the mixed region test set.

Table 8. Per-class result on the COCO-stuff test set.

Download CSV Display Table

Table 9. Evaluation scores (%) of different baseline methods based on the COCO-stuff test set.

Download CSV Display Table

Table 10. Comparison between baseline and state-of-the-art methods for experiment 2. The best values are highlighted in bold.

Download CSV Display Table

Table 11. Per-class result for experiment 3.

Download CSV Display Table

Table 12. Comparison between baseline and state-of-the-art methods for experiment 3.

Download CSV Display Table

Table 13. Experimental results with various backbone network combinations.

Download CSV Display Table

Table 14. Evaluation results of the hierarchical levels.

Download CSV Display Table

Table 15. Influence of main components on semantic segmentation results.

Download CSV Display Table

Table 16. Influence of different stages on semantic segmentation results.

Download CSV Display Table

Pedregosa, Fabian, Gael Varoquaux, Alexandre Gramfort, Vincent Michel, and Bertrand Thirion. 2011. “Scikit-Learn: Machine Learning in Python.” Journal of Machine Learning Research 12 (2011): 2825–2830. https://doi.org/10.48550/arXiv.1201.0490.

Google Scholar

Achanta, Radhakrishna, Appu Shaji, Kevin Smith, Aurelie Lucchi, and Pascal Fua. 2012. “SLIC Superpixels Compared to State-of-the-Art Superpixel Methods.” IEEE Transactions on Pattern Analysis and Machine Intelligence 34 (11): 2274–2281. https://doi.org/10.1109/TPAMI.2012.120.

PubMed Web of Science ®Google Scholar

Lei, Tao, Peng Liu, Xiaohong Jia, Xuande Zhang, Hongying Meng, and Asoke K. Nandi. 2020. “Automatic Fuzzy Clustering Framework for Image Segmentation.” IEEE Transactions on Fuzzy Systems 28 (9): 2078–2092. https://doi.org/10.1109/TFUZZ.2019.2930030.

Web of Science ®Google Scholar

Jia, Xiaohong, Tao Lei, Peng Liu, Dinghua Xue, Hongying Meng, and Asoke K. Nandi. 2020. “Fast and Automatic Image Segmentation Using Superpixel-Based Graph Clustering.” IEEE Access 8: 211526–211539. https://doi.org/10.1109/ACCESS.2020.3039742.

Web of Science ®Google Scholar

Yang, Zenan, Haipeng Niu, Liang Huang, Xiaoxuan Wang, and Liangxin Fan. 2022. “Automatic Segmentation Algorithm for High-Spatial-Resolution Remote Sensing Images Based on Self-Learning Super-Pixel Convolutional Network.” International Journal of Digital Earth 15 (1): 1101–1124. https://doi.org/10.1080/17538947.2022.2083247.

Web of Science ®Google Scholar

Cho, Jang Hyun, Utkarsh Mall, Kavita Bala, and Bharath Hariharan. 2021. “PiCIE: Unsupervised Semantic Segmentation Using Invariance and Equivariance in Clustering.” Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 16794–16804.

Google Scholar

Hamilton, Mark, Zhoutong Zhang, Bharath Hariharan, Noah Snavely, and William T. Freeman. 2022. “Unsupervised Semantic Segmentation by Distilling Feature Correspondences.” Proceedings of the International Conference on Learning Representations.

Google Scholar

Data availability statement

The code used in this study are available by contacting the corresponding author.

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

An unsupervised semantic segmentation method that combines the ImSE-Net model with SLICm superpixel optimization

Table 1. Detailed process for the ImSE-Net.

Table 2. Detailed process for superpixel refinement.

Table 3. Detailed process for unsupervised semantic segmentation UGLS.

Table 4. Grouping information about datasets.

Table 5. Test set task categories.

Table 6. Quantitative evaluation indicators.

Table 7. Description of the unsupervised semantic segmentation algorithms.

Table 8. Per-class result on the COCO-stuff test set.

Table 9. Evaluation scores (%) of different baseline methods based on the COCO-stuff test set.

Table 10. Comparison between baseline and state-of-the-art methods for experiment 2. The best values are highlighted in bold.

Table 11. Per-class result for experiment 3.

Table 12. Comparison between baseline and state-of-the-art methods for experiment 3.

Table 13. Experimental results with various backbone network combinations.

Table 14. Evaluation results of the hierarchical levels.

Table 15. Influence of main components on semantic segmentation results.

Table 16. Influence of different stages on semantic segmentation results.

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

An unsupervised semantic segmentation method that combines the ImSE-Net model with SLICm superpixel optimization

Figures & data

Table 1. Detailed process for the ImSE-Net.

Table 2. Detailed process for superpixel refinement.

Table 3. Detailed process for unsupervised semantic segmentation UGLS.

Table 4. Grouping information about datasets.

Table 5. Test set task categories.

Table 6. Quantitative evaluation indicators.

Table 7. Description of the unsupervised semantic segmentation algorithms.

Table 8. Per-class result on the COCO-stuff test set.

Table 9. Evaluation scores (%) of different baseline methods based on the COCO-stuff test set.

Table 10. Comparison between baseline and state-of-the-art methods for experiment 2. The best values are highlighted in bold.

Table 11. Per-class result for experiment 3.

Table 12. Comparison between baseline and state-of-the-art methods for experiment 3.

Table 13. Experimental results with various backbone network combinations.

Table 14. Evaluation results of the hierarchical levels.

Table 15. Influence of main components on semantic segmentation results.

Table 16. Influence of different stages on semantic segmentation results.

Data availability statement

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date