Title
Scalable Concatenative Speech Synthesis Based On The Plural Unit Selection And Fusion Method
Abstract
Recently, concatenative speech synthesizers with large databases have been widely developed for high-quality speech synthesis. However, some platforms require a speech synthesis system that can work under the limitation of memory footprint or computational cost. In this paper, we propose a scalable concatenative speech synthesizer based on the plural speech unit selection and fusion method. To realize scalability, we propose the offline unit fusion method in which pitch-cycle waveforms for voiced segments are fused in advance. The experimental results show that the synthetic speech of the offline unit fusion method with half-size waveform database is comparable to that of the online unit fusion method, while the computation cost is reduced to 1/10.
Year
DOI
Venue
2005
10.1109/ICASSP.2005.1415125
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING
Keywords
Field
DocType
natural languages,cost function,sensor fusion,speech synthesis,scalability
Speech processing,Speech synthesis,Pattern recognition,Computer science,Voice activity detection,Waveform,Sensor fusion,Speech recognition,Artificial intelligence,Memory footprint,Scalability,Computation
Conference
ISSN
Citations 
PageRank 
1520-6149
2
0.70
References 
Authors
5
3
Name
Order
Citations
PageRank
Masatsune Tamura110715.26
Tatsuya Mizutani2132.79
Takehiko Kagoshima3428.66