Abstract | ||
---|---|---|
Text-to-Prosody systems based on the use of prosodic databases extracted from natural speech will be a key point for further development of new Text-to-Speech systems. This paper describes a system using such speech databases to generate the rhythm and the intonation of texts written in French. The system is based on a very crude chinks 'n chunks prosodic phrasing algorithm and on an automatic prosodic analysis of a natural speech database. The rhythm of the synthetic speech is generated with a CART tree trained on a large mono-speaker speech corpus. The acoustic aspect of the intonation is derived from a set of prosodic patterns automatically derived from the same speech corpus. At synthesis time, patterns are chosen on the fly from the database so as to minimize a total selection cost composed of pattern target costs and pattern concatenation costs. |
Year | Venue | Keywords |
---|---|---|
1998 | SSW | text to speech |
Field | DocType | Citations |
Prosody,Computer science,Speech recognition,Natural language processing,Artificial intelligence | Conference | 12 |
PageRank | References | Authors |
1.62 | 8 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
F. Malfrère | 1 | 32 | 3.10 |
T. Dutoit | 2 | 313 | 30.47 |
P. Mertens | 3 | 39 | 5.88 |