Abstract | ||
---|---|---|
Text-to-speech synthesis research has moved away from building general purpose systems based on an understanding of human language and speech production towards building sys- tems based on statistical algorithms applied to large text and speech corpora, and, recently, towards building such systems for specific domains. Despite substantial progress, the overall qual- ity of even the best systems is often still inadequate for broad user acceptance in applications that cannot also be handled with simple phrase splicing. This tutorial paper analyzes which prob- lems must be addressed to achieve the goal of generating natural- sounding speech in limited domains in a cost-effective way, and the roles of data and rules as we work towards solutions. |
Year | Venue | Keywords |
---|---|---|
2000 | INTERSPEECH | cost effectiveness,speech production |
Field | DocType | Citations |
Speech analytics,General purpose,Computer science,Phrase,Speech recognition,Natural language processing,Artificial intelligence,Speech production,Speech technology | Conference | 0 |
PageRank | References | Authors |
0.34 | 15 | 7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jan P. H. van Santen | 1 | 514 | 99.66 |
Michael W. Macon | 2 | 334 | 25.79 |
Andrew Cronk | 3 | 7 | 2.62 |
John-Paul Hosom | 4 | 231 | 23.43 |
Alexander Kain | 5 | 377 | 32.39 |
Vincent Pagel | 6 | 174 | 24.60 |
Johan Wouters | 7 | 0 | 0.34 |