Speaker Recognition in an Emotionalized Spontaneous Speech Using Empirical Mode Decomposition - Citegraph

Paper Info

Title
Speaker Recognition in an Emotionalized Spontaneous Speech Using Empirical Mode Decomposition

Abstract
Speaker recognition in emotionalized spontaneous speech is the key technique of human-centered computing. Existed speaker recognition systems are hard to work during real-time application, because the extracted features of voiceprint aren't consistent in each time. This is caused by the variety of emotionalized intonation and contaminated acoustic signals of speaker, This Is the problem we encountered when we tried to identify the speaker with the emotionalized spontaneous speech in real-time. To solve this problem, some inherent features from varying emotionatized spontaneous speech through different experiments are presented in the first part of this paper. Then the original speech signal is decomposed into several intrinsic mode functions based on the empirical mode decomposition. According to those intrinsic mode functions, the inherent features of the speaker's voiceprint are identified. Thus, a Gaussian mixture model can be constructed according to those features. However, the experiment based on our corpus of mandarin spontaneous speech showed one conclusion that the identifying rate is improved for fifteen percentage in average under our endeavors.

Year	Venue	Keywords
2007	Lecture Notes in Engineering and Computer Science	speaker recognition,spontaneous speech,empirical mode decomposition,intrinsic mode function
Field	DocType	ISSN
Computer science,Speech recognition,Speaker recognition,Speaker diarisation,Hilbert–Huang transform	Conference	2078-0958
Citations	PageRank	References
0	0.34	1
Authors
3

Authors (3 rows)

Cited by (0 rows)

References (1 rows)

Name	Order	Citations	PageRank
Fu-hua Chou	1	6	1.72
Yu-shuo Liu	2	0	0.34
Che Wun Chiou	3	272	21.81

1