1,623
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Multi-branch feature learning based speech emotion recognition using SCAR-NET

, , , , &
Article: 2189217 | Received 25 Dec 2022, Accepted 04 Mar 2023, Published online: 27 Apr 2023

Figures & data

Figure 1. The overall structure of SCAR-NET.

Figure 1. The overall structure of SCAR-NET.

Figure 2. The MFCC spectrum and the receptive fields of the parallel paths.

Figure 2. The MFCC spectrum and the receptive fields of the parallel paths.

Figure 3. The residual block and the SCAR block.

Figure 3. The residual block and the SCAR block.

Figure 4. The working principle of GAP.

Figure 4. The working principle of GAP.

Table 1. The detailed descriptions of EMO-DB, SAVEE, and RAVDESS datasets.

Table 2. The classification report of SCAR-NET evaluated on datasets.

Figure 5. The confusion matrix of SCAR-NET evaluated on EMO-DB.

Figure 5. The confusion matrix of SCAR-NET evaluated on EMO-DB.

Figure 6. The confusion matrix of SCAR-NET evaluated on SAVEE.

Figure 6. The confusion matrix of SCAR-NET evaluated on SAVEE.

Figure 7. The confusion matrix of SCAR-NET evaluated on RAVDESS.

Figure 7. The confusion matrix of SCAR-NET evaluated on RAVDESS.

Table 3. The effect of parallel paths on experimental evaluation.

Table 4. The impact of SCA structure with diverse branch numbers on experimental evaluation.

Table 5. The impact of diverse SCAR blocks numbers on experimental evaluation.

Table 6. Comparative analysis of SCAR-NET with recent SER methods.