Title | ||
---|---|---|
Scalable Concatenative Speech Synthesis Based On The Plural Unit Selection And Fusion Method |
Abstract | ||
---|---|---|
Recently, concatenative speech synthesizers with large databases have been widely developed for high-quality speech synthesis. However, some platforms require a speech synthesis system that can work under the limitation of memory footprint or computational cost. In this paper, we propose a scalable concatenative speech synthesizer based on the plural speech unit selection and fusion method. To realize scalability, we propose the offline unit fusion method in which pitch-cycle waveforms for voiced segments are fused in advance. The experimental results show that the synthetic speech of the offline unit fusion method with half-size waveform database is comparable to that of the online unit fusion method, while the computation cost is reduced to 1/10. |
Year | DOI | Venue |
---|---|---|
2005 | 10.1109/ICASSP.2005.1415125 | 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING |
Keywords | Field | DocType |
natural languages,cost function,sensor fusion,speech synthesis,scalability | Speech processing,Speech synthesis,Pattern recognition,Computer science,Voice activity detection,Waveform,Sensor fusion,Speech recognition,Artificial intelligence,Memory footprint,Scalability,Computation | Conference |
ISSN | Citations | PageRank |
1520-6149 | 2 | 0.70 |
References | Authors | |
5 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Masatsune Tamura | 1 | 107 | 15.26 |
Tatsuya Mizutani | 2 | 13 | 2.79 |
Takehiko Kagoshima | 3 | 42 | 8.66 |