Abstract | ||
---|---|---|
In this paper, we present an audio-based emotion recognition model by using OpenSmile, Gaussian mixture models (GMMs) Supervector and Support vector machines (SVM) with Linear kernel. Features are extracted from audio characteristics of emotional video through OpenSmile into Mel-frequency Cepstral Coefficient (MFCC) of 39 dimensions for each video. Furthermore, these features are normalized to the same size using GMM Supervector with 32 mixture components. Finally, data is classified using SVM with Linear Kernel. To evaluate the model, this paper using the AFEW2017 dataset and SAVEE dataset and show comparable the results on the state-of-the-art network. The experimental results perform with 37% on AFEW and 73.5% on SAVEE dataset. Our proposed achieves improved emotion recognition from audio as compared to several other models. |
Year | DOI | Venue |
---|---|---|
2018 | 10.1145/3184066.3184086 | 2ND INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING (ICMLSC 2018) |
Keywords | Field | DocType |
Audio-based Emotion Recognition, MFCC, Gaussian Mixture Model (GMM) Supervector, Support Vector Machines (SVM), OpenSmile toolkit | Kernel (linear algebra),Mel-frequency cepstrum,Normalization (statistics),Pattern recognition,Emotion recognition,Computer science,Cepstrum,Support vector machine,Artificial intelligence,Mixture model | Conference |
Citations | PageRank | References |
0 | 0.34 | 13 |
Authors | ||
7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Dinh-Son Tran | 1 | 0 | 0.34 |
Hyungjeong Yang | 2 | 455 | 47.05 |
Soo-Hyung Kim | 3 | 191 | 49.03 |
Gueesang Lee | 4 | 208 | 52.71 |
Luu Ngoc Do | 5 | 10 | 1.96 |
Ngoc-Huynh Ho | 6 | 10 | 3.51 |
Van Quan Nguyen | 7 | 4 | 1.78 |