Title
Dnn-Based Emotion Recognition Based On Bottleneck Acoustic Features And Lexical Features
Abstract
In this paper, we propose a novel emotion recognition method to reflect affect salient information using acoustic and lexical features. The acoustic features are extracted from the speech signal by applying statistical functionals of emotionally high-level features derived from Deep Neural Network (DNN). These acoustic features are early fused with two types of lexical features extracted from the text transcription of the speech signal, which are the distributed representation and affective lexicon-based dimensions. The fused features are fed to another DNN for utterance-level emotion classification. Experimental results on the Interactive Emotional Dyadic Motion Capture (IEMOCAP) multimodal dataset showed 75.5% in unweighted accuracy recall, which outperformed the best results reported previously in the multimodal emotion recognition using acoustic and lexical features.
Year
DOI
Venue
2019
10.1109/icassp.2019.8683077
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)
Keywords
Field
DocType
Multimodal emotion recognition, DNN-based emotion recognition, Acoustic feature, Lexical feature
Motion capture,Bottleneck,Pattern recognition,Computer science,Emotion classification,Speech recognition,Lexicon,Artificial intelligence,Artificial neural network,Affect (psychology),Recall,Salient
Conference
ISSN
Citations 
PageRank 
1520-6149
0
0.34
References 
Authors
0
2
Name
Order
Citations
PageRank
Eesung Kim111.73
Jong Won Shin221521.85