Abstract | ||
---|---|---|
Modulation of speaking tone in frequency can make speech interesting and convey subtle meaning in communication. We present a frequency modulation (FM) technique for prosodic modification to consider communicative speech synthesis. This technique provides a mathematical formulation for representing speaking tone and manipulating FM in a unified framework. Two experiments are conducted with a text-to-speech system to which a module of FM-based prosodic modification is added. One is to enhance emphasis in words when synthesizing Chinese conversational speech. The other is to modify reading-style prosody while conveying good and bad news in Japanese; this is done by using the FM technique to shift the frequency ranges and rescale the fundamental frequency contours jointly. The experimental results indicated that the native speakers identified 90% of samples with emphases and 78% of "good news" as well as 94% of "bad news" samples. The FM technique is vital for making synthetic speech communicative. |
Year | DOI | Venue |
---|---|---|
2008 | 10.1109/CHINSL.2008.ECP.41 | 2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS |
Keywords | Field | DocType |
frequency modulation, prosodic modification, intonation, speech synthesis | Prosody,Speech synthesis,Fundamental frequency,Computer science,Modulation,Speech recognition,Frequency modulation,Hidden Markov model,Text processing | Conference |
Citations | PageRank | References |
3 | 0.57 | 6 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jinfu Ni | 1 | 88 | 16.32 |
Shinsuke Sakai | 2 | 126 | 23.52 |
Tohru Shimizu | 3 | 57 | 12.85 |
Satoshi Nakamura | 4 | 1099 | 194.59 |