Abstract | ||
---|---|---|
Several speech enhancement approaches utilize trained models of clean speech data, such as codebooks, Gaussian mixtures, and hidden Markov models. These models are typically trained on neutral clean speech data, without any emotion. However, in practical scenarios, emotional speech is a common occurrence, which brings into question the suitability of using models trained on neutral speech for enhancement of noisy emotional speech. We investigate this problem using the example of a codebook-based speech enhancement approach, which utilizes trained codebooks of linear prediction parameters. Anger and happiness are used as examples of emotions. Our experiments demonstrate that employing emotion-dependent speech codebooks results in a significant benefit over using emotion-independent codebooks for enhancing emotional noisy speech. We also present results using a Bayesian framework employing both emotiondependent and independent speech codebooks that exhibits a robust behavior when the type of emotion is not known a priori. Index Terms ?? Speech enhancement, codebook, emotional speech |
Year | Venue | Field |
---|---|---|
2012 | IWAENC | Speech enhancement,Speech processing,Speech coding,Voice activity detection,Computer science,PSQM,Speech recognition,Hidden Markov model,Linear predictive coding,Codebook |
DocType | ISBN | Citations |
Conference | 978-3-8007-3451-1 | 0 |
PageRank | References | Authors |
0.34 | 0 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
D. Hanumantha Rao Naidu | 1 | 0 | 0.34 |
Sriram Srinivasan | 2 | 379 | 27.92 |