Abstract | ||
---|---|---|
CU VOCAL is a Cantonese text-to-speech (TTS) engine. We use a syllable-based concatenative synthesis approach to generate intelligible and natural synthesized speech (1). This paper describes several recent enhancements in CU VOCAL. First, we have augmented the syllable unit selection strategy with a positional feature. This feature specifies the relative location of a syllable in a sentence and serves to improve the quality of Cantonese tone realization. Second, we have developed the CU VOCAL SAPI engine, a version of the synthesizer that eases integration with applications using SAPI (Speech Application Programming Interface). We demonstrate the use of CU VOCAL SAPI in an electronic book (e-book) reader. Third, we have made an initial attempt to use the CU VOCAL SAPI engine in Web content authored with Speech Application Language Tags (SALT). The use of SALT tags can ease the task of invoking Cantonese TTS service on webpages. |
Year | Venue | Keywords |
---|---|---|
2003 | INTERSPEECH | general intelligence,application program interface,text to speech |
Field | DocType | Citations |
Concatenative synthesis,Speech Application Language Tags,Electronic book,Web page,Computer science,Speech recognition,Application programming interface,Syllable,Natural language processing,Artificial intelligence,Web content,Sentence | Conference | 2 |
PageRank | References | Authors |
0.58 | 4 | 8 |
Name | Order | Citations | PageRank |
---|---|---|---|
Helen M. Meng | 1 | 1078 | 172.82 |
Yuk-Chi Li | 2 | 67 | 6.30 |
Tien-Ying Fung | 3 | 24 | 5.09 |
Man-Cheuk Ho | 4 | 5 | 1.27 |
Chi-Kin Keung | 5 | 91 | 6.27 |
Tin-Hang Lo | 6 | 15 | 2.74 |
Wai-Kit Lo | 7 | 222 | 23.01 |
Pak-chung Ching | 8 | 1366 | 139.74 |