Abstract | ||
---|---|---|
This paper presents a spectral transformation method for emotional speech synthesis based on voice conversion framework. Three emotions are studied, including anger, happiness and sadness. For the sake of high naturalness, superior speech quality and emotion expressiveness, our original STASC system is modified by introducing a new feature selection strategy and hierarchical codebook mapping procedure. Our result shows that the LSF coefficients at low frequency carry more emotion-relative information, and therefore only these coefficients are converted. Listening tests prove that the proposed method can achieve a satisfactory balance between emotional expression and speech quality of converted speech signals. |
Year | DOI | Venue |
---|---|---|
2005 | 10.1007/11573548_48 | ACII |
Keywords | Field | DocType |
spectral transformation method,emotional expression,converted speech signal,emotion-relative information,superior speech quality,improved codebook mapping voice,lsf coefficient,speech quality,emotion expressiveness,emotional speech synthesis,speech synthesis,feature selection,low frequency | Sadness,Speech synthesis,Feature selection,Voice activity detection,Computer science,Naturalness,Active listening,Speech recognition,Emotional expression,Codebook | Conference |
Volume | ISSN | ISBN |
3784 | 0302-9743 | 3-540-29621-2 |
Citations | PageRank | References |
2 | 0.36 | 4 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yu-Ping Wang | 1 | 2 | 0.36 |
Zhen-Hua Ling | 2 | 850 | 83.08 |
Ren-Hua Wang | 3 | 344 | 41.36 |