LIGHT-SERNET: A Lightweight Fully Convolutional Neural Network for Speech Emotion Recognition - Citegraph

Paper Info

Title
LIGHT-SERNET: A Lightweight Fully Convolutional Neural Network for Speech Emotion Recognition

Abstract
Detecting emotions directly from a speech signal plays an important role in effective human-computer interactions. Existing speech emotion recognition models require massive computational and storage resources, making them hard to implement concurrently with other machine-interactive tasks in embedded systems. In this paper, we propose an efficient and lightweight fully convolutional neural network for speech emotion recognition in systems with limited hardware resources. In the proposed FCNN model, various feature maps are extracted via three parallel paths with different filter sizes. This helps deep convolution blocks to extract high-level features, while ensuring sufficient separability. The extracted features are used to classify the emotion of the input speech segment. While our model has a smaller size than that of the state-of-the-art models, it achieves higher performance on the IEMOCAP and EMO-DB datasets.

Year	DOI	Venue
2022	10.1109/ICASSP43922.2022.9746679	IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Arya Aftab	1	0	0.34
Alireza Morsali	2	2	4.08
Shahrokh Ghaemmaghami	3	0	0.34
Benoît Champagne	4	510	67.66

1