Title
Speaker Recognition in an Emotionalized Spontaneous Speech Using Empirical Mode Decomposition
Abstract
Speaker recognition in emotionalized spontaneous speech is the key technique of human-centered computing. Existed speaker recognition systems are hard to work during real-time application, because the extracted features of voiceprint aren't consistent in each time. This is caused by the variety of emotionalized intonation and contaminated acoustic signals of speaker, This Is the problem we encountered when we tried to identify the speaker with the emotionalized spontaneous speech in real-time. To solve this problem, some inherent features from varying emotionatized spontaneous speech through different experiments are presented in the first part of this paper. Then the original speech signal is decomposed into several intrinsic mode functions based on the empirical mode decomposition. According to those intrinsic mode functions, the inherent features of the speaker's voiceprint are identified. Thus, a Gaussian mixture model can be constructed according to those features. However, the experiment based on our corpus of mandarin spontaneous speech showed one conclusion that the identifying rate is improved for fifteen percentage in average under our endeavors.
Year
Venue
Keywords
2007
Lecture Notes in Engineering and Computer Science
speaker recognition,spontaneous speech,empirical mode decomposition,intrinsic mode function
Field
DocType
ISSN
Computer science,Speech recognition,Speaker recognition,Speaker diarisation,Hilbert–Huang transform
Conference
2078-0958
Citations 
PageRank 
References 
0
0.34
1
Authors
3
Name
Order
Citations
PageRank
Fu-hua Chou161.72
Yu-shuo Liu200.34
Che Wun Chiou327221.81