2,119
Views
2
CrossRef citations to date
0
Altmetric
Research Article

GAN-BElectra: Enhanced Multi-class Sentiment Analysis with Limited Labeled Data

&
Article: 2083794 | Received 05 Feb 2022, Accepted 25 May 2022, Published online: 26 Jun 2022

Figures & data

Figure 1. Example of basic lexicon-based sentiment analysis.

Figure 1. Example of basic lexicon-based sentiment analysis.

Figure 2. GAN-BERT Architecture (Croce, Castellucci, and Basili Citation2020). U, L, F, G, D denote unlabeled data, labeled data, fake labels, generator, and discriminator respectively.

Figure 2. GAN-BERT Architecture (Croce, Castellucci, and Basili Citation2020). U, L, F, G, D denote unlabeled data, labeled data, fake labels, generator, and discriminator respectively.

Figure 3. Electra-based pretrained component.

Figure 3. Electra-based pretrained component.

Figure 4. SG-Elect architecture (Riyadh and Shafiq Citation2021), U, L, C denote unlabeled data, labeled data, and combinator respectively.

Figure 4. SG-Elect architecture (Riyadh and Shafiq Citation2021), U, L, C denote unlabeled data, labeled data, and combinator respectively.

Figure 5. GAN-BElectra architecture. U and L denote unlabeled and labeled data respectively.

Figure 5. GAN-BElectra architecture. U and L denote unlabeled and labeled data respectively.

Algorithm I: Training process of GAN-BElectra based on SG-Elect (Riyadh and Shafiq Citation2021)

Figure 6. SST5 dataset composition.

Figure 6. SST5 dataset composition.

Figure 7. US Airline dataset composition.

Figure 7. US Airline dataset composition.

Figure 8. SemEval dataset composition.

Figure 8. SemEval dataset composition.

Table 1. Summary of Results (F1-Score, Accuracy, Standard Deviation).

Table 2. Summary of Results (Standard Error, Confidence Interval of Standard Error).

Table 3. GAN-BERT’s accuracy in pseudo label generation across three datasets.

Table 4. Detailed Results for the SST5 Dataset.

Table 5. Detailed Results for the US Airline Dataset.

Table 6. Detailed Results for the SemEval Dataset.

Figure 9. Confusion matrices for SST5 dataset for (a) GAN-BElectra (b) SG-Elect (c) Electra and (d) GAN-BERT.

Figure 9. Confusion matrices for SST5 dataset for (a) GAN-BElectra (b) SG-Elect (c) Electra and (d) GAN-BERT.

Figure 10. Confusion matrices (with red shades) for the US Airline dataset for (a) GAN-BElectra (b) SG-Elect (c) Electra and (d) GAN-BART and confusion matrices (with green shades) for the SemEval dataset (e) GAN-BElectra (f) SG-Elect (g) Electra and (h) GAN-BART.

Figure 10. Confusion matrices (with red shades) for the US Airline dataset for (a) GAN-BElectra (b) SG-Elect (c) Electra and (d) GAN-BART and confusion matrices (with green shades) for the SemEval dataset (e) GAN-BElectra (f) SG-Elect (g) Electra and (h) GAN-BART.