Title
Using Neutralized Formant Frequencies To Improve Emotional Speech Recognition
Abstract
Emotion of speech degrades the performance of Automatic Speech Recognition (ASR) systems. With the aim of enhancing the emotional speech recognition accuracy, the effects of formant frequencies and their slopes on improving the performance are investigated in this paper. For this purpose, the formant frequencies are neutralized using hybrid of Dynamic Time Warping (DTW) and Multi-Layer Perceptron (MLP) neural networks. Each one of the neutralized formant frequencies is considered as a supplementary feature and used in Hidden Markov Model (HMM)-based ASR. Experimental results show that by using the slope of neutralized formant frequency features, the recognition rate in happiness and anger states is improved by at most 2.1% and 3.6%, respectively.
Year
DOI
Venue
2011
10.1587/elex.8.1155
IEICE ELECTRONICS EXPRESS
Keywords
Field
DocType
emotional speech recognition, formant neutralization, DTW, MLP, HMM
Computer science,Speech recognition,Formant,Hidden Markov model
Journal
Volume
Issue
ISSN
8
14
1349-2543
Citations 
PageRank 
References 
0
0.34
2
Authors
3
Name
Order
Citations
PageRank
Davood Gharavian111710.06
Mansour Sheikhan229720.38
Farhad Ashoftedel3201.23