Search in:

Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization Volume 12, 2024 - Issue 1

Submit an article Journal homepage

Open access

277

Views

CrossRef citations to date

Altmetric

Research Article

Less-supervised learning with knowledge distillation for sperm morphology analysis

Ali NabipourDepartment of Computer Engineering, Faculty of Engineering, University of Guilan, Rasht, Iran

https://orcid.org/0009-0007-7274-1229 View further author information

Mohammad Javad Shams NejatiDepartment of Computer Engineering, Faculty of Engineering, University of Guilan, Rasht, Iran

https://orcid.org/0009-0005-6751-5636 View further author information

Yasaman BoreshbanDepartment of Computer Engineering, Faculty of Engineering, University of Guilan, Rasht, Iran

https://orcid.org/0000-0003-1373-0128 View further author information

Seyed Abolghasem MirroshandelDepartment of Computer Engineering, Faculty of Engineering, University of Guilan, Rasht, IranCorrespondence[email protected]

https://orcid.org/0000-0001-8853-9112 View further author information

Article: 2347978 | Received 02 May 2023, Accepted 17 Apr 2024, Published online: 08 May 2024

Cite this article
https://doi.org/10.1080/21681163.2024.2347978
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Figures & data

Figure 1. An example of the images in MHSMA dataset. Both images represent the same sperm. One is 128x128 pixels, and the other one is cropped 64x64 pixels.

Figure 2. Images and labels of MHSMA dataset for different parts of sperm such as vacuole, tail, head, and acrosome.

Table 1. Number of positive and negative samples in MHSMA dataset. There are 1,540 sperm images in the dataset labelled as normal or abnormal.

Download CSV Display Table

Figure 3. The AD procedure in our proposed method. This flowchart shows how we determine whether a raw image is normal or abnormal.

Figure 4. Visualized summary of our method, an offline distillation approach. LSn is the $n th$ layer of the student network and LTn is the teacher one. Also, this figure roughly shows which layers we use as critical layers.

Figure 5. Distance vectors for normal and abnormal samples. This figure shows the distance of several samples of normal and abnormal sperm that we detect anomalies based on that. This graph is obtained from the actual data of this dataset after training the model.

Figure 6. The architecture of the VGG16 network encoder (without the linear layers). This network is used as a teacher network in our method.

Figure 7. Model A: a simple network proposed to analyze the distillation effect. This network is used as a student network in our method.

Figure 8. Model B: the architecture of proposed student network. This network is used as a student network in our method.

Figure 9. Example of a sperm image in the dataset with flip augmentation applied during the training phase.

Figure 10. Example of a sperm image in the dataset with rotate augmentation applied during the training phase.

Figure 11. Gradient vector calculated from an image. This vector is used to add an adversarial attack to the image based on the value of gradients.

Figure 12. The steps of creating an attacked image from a normal image. Summing up the image with the epsilon portion of the sign of the gradients creates an attacked image.

Table 2. The outcomes of each part of sperm produced by the proposed method. The optimal setup of the suggested approach on the test set yielded these results.

Display Table

Table 3. Confusion matrix of the proposed method for each part of the sperm. These results are achieved from the evaluation phase on the test set.

Download CSV Display Table

Table 4. Comparison of our results with those achieved by Ghasemian et al. (Citation2015) and Javadi and Mirroshandel (Citation2019) on each part of sperm evaluated on the test set of the MHSMA dataset.

Display Table

Figure 13. Boxplot of total loss mean for two groups. This plot indicates that the mean of the two groups is different.

Table 5. ANOVA test results for comparison of our results with those achieved by Javadi and Mirroshandel (Citation2019).

Download CSV Display Table

Figure 14. Anomaly localization map on abnormal samples from the test set. This map shows which parts of the image the model paid more attention to.

Table 6. Results achieved with distinct $λ$ values on validation set. Different $λ$ values can balance the effect of multiple losses.

Display Table

Figure 15. The result of using different architectures for student networks. Model a has a simple architecture. Model B has the proposed student architecture. Model C has the same architecture as the teacher network.

Table 7. Number of trainable parameters in three proposed models to show the effect of distillation. Changing the architecture of each network changes the number of its trainable parameters.

Download CSV Display Table

Table 8. Effect of critical layers on the performance of the proposed method. These results show how changing the number of critical layers in loss calculation can affect performance.

Display Table

Table 9. Model performance metrics comparison with and without data augmentation and adversarial attack.

Display Table

Appendix A. Table of abbreviations used in the text with their description.

Download CSV Display Table

Ghasemian F, Mirroshandel SA, Monji-Azad S, Azarnia M, Zahiri Z. 2015. An efficient method for automatic morphological abnormality detection from human sperm images. Comput Methods Programs Biomed. 122(3):409–420. doi: 10.1016/j.cmpb.2015.08.013.

PubMed Web of Science ®Google Scholar

Javadi S, Mirroshandel SA. 2019. A novel deep learning method for automatic assessment of human sperm images. Comput Biol Med. 109:182–194. doi: 10.1016/j.compbiomed.2019.04.030.

PubMed Web of Science ®Google Scholar

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Less-supervised learning with knowledge distillation for sperm morphology analysis

Table 1. Number of positive and negative samples in MHSMA dataset. There are 1,540 sperm images in the dataset labelled as normal or abnormal.

Table 2. The outcomes of each part of sperm produced by the proposed method. The optimal setup of the suggested approach on the test set yielded these results.

Table 3. Confusion matrix of the proposed method for each part of the sperm. These results are achieved from the evaluation phase on the test set.

Table 4. Comparison of our results with those achieved by Ghasemian et al. (Citation2015) and Javadi and Mirroshandel (Citation2019) on each part of sperm evaluated on the test set of the MHSMA dataset.

Table 5. ANOVA test results for comparison of our results with those achieved by Javadi and Mirroshandel (Citation2019).

Table 6. Results achieved with distinct $λ$ values on validation set. Different $λ$ values can balance the effect of multiple losses.

Table 7. Number of trainable parameters in three proposed models to show the effect of distillation. Changing the architecture of each network changes the number of its trainable parameters.

Table 8. Effect of critical layers on the performance of the proposed method. These results show how changing the number of critical layers in loss calculation can affect performance.

Table 9. Model performance metrics comparison with and without data augmentation and adversarial attack.

Appendix A. Table of abbreviations used in the text with their description.

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Less-supervised learning with knowledge distillation for sperm morphology analysis

Figures & data

Table 1. Number of positive and negative samples in MHSMA dataset. There are 1,540 sperm images in the dataset labelled as normal or abnormal.

Table 2. The outcomes of each part of sperm produced by the proposed method. The optimal setup of the suggested approach on the test set yielded these results.

Table 3. Confusion matrix of the proposed method for each part of the sperm. These results are achieved from the evaluation phase on the test set.

Table 4. Comparison of our results with those achieved by Ghasemian et al. (Citation2015) and Javadi and Mirroshandel (Citation2019) on each part of sperm evaluated on the test set of the MHSMA dataset.

Table 5. ANOVA test results for comparison of our results with those achieved by Javadi and Mirroshandel (Citation2019).

Table 6. Results achieved with distinct λ values on validation set. Different λ values can balance the effect of multiple losses.

Table 7. Number of trainable parameters in three proposed models to show the effect of distillation. Changing the architecture of each network changes the number of its trainable parameters.

Table 8. Effect of critical layers on the performance of the proposed method. These results show how changing the number of critical layers in loss calculation can affect performance.

Table 9. Model performance metrics comparison with and without data augmentation and adversarial attack.

Appendix A. Table of abbreviations used in the text with their description.

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 6. Results achieved with distinct $λ$ values on validation set. Different $λ$ values can balance the effect of multiple losses.