Abstract | ||
---|---|---|
This paper introduces the IBM Expressive Speech Synthesis system. We describe recent work in improving the quality of our baseline text-to-speech system as well as extending our capabilities to generate expressive synthetic speech. We present results showing improved base quality, especially for sentences drawn from a limited domain. We also demonstrate our ability to convey good news and bad news, produce contrastive emphasis, and ask a question appropriately. In order to facilitate access to the expressive capabilities, we use some of our proposed extensions to the Speech Synthesis Markup Language (SSML). |
Year | Venue | Keywords |
---|---|---|
2004 | INTERSPEECH | text to speech,speech synthesis |
Field | DocType | Citations |
Speech corpus,IBM,Speech synthesis,Ask price,Speech Synthesis Markup Language,Computer science,Chinese speech synthesis,Speech recognition,Natural language processing,Telegraphic speech,Artificial intelligence,Speech technology | Conference | 18 |
PageRank | References | Authors |
1.20 | 7 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
wael hamza | 1 | 198 | 15.84 |
Ellen Eide | 2 | 96 | 19.16 |
Raimo Bakis | 3 | 153 | 308.32 |
Michael Picheny | 4 | 1461 | 920.15 |
John F. Pitrelli | 5 | 493 | 81.16 |