Scalable Concatenative Speech Synthesis Based On The Plural Unit Selection And Fusion Method - Citegraph

Paper Info

Title
Scalable Concatenative Speech Synthesis Based On The Plural Unit Selection And Fusion Method

Abstract
Recently, concatenative speech synthesizers with large databases have been widely developed for high-quality speech synthesis. However, some platforms require a speech synthesis system that can work under the limitation of memory footprint or computational cost. In this paper, we propose a scalable concatenative speech synthesizer based on the plural speech unit selection and fusion method. To realize scalability, we propose the offline unit fusion method in which pitch-cycle waveforms for voiced segments are fused in advance. The experimental results show that the synthetic speech of the offline unit fusion method with half-size waveform database is comparable to that of the online unit fusion method, while the computation cost is reduced to 1/10.

Year	DOI	Venue
2005	10.1109/ICASSP.2005.1415125	2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING
Keywords	Field	DocType
natural languages,cost function,sensor fusion,speech synthesis,scalability	Speech processing,Speech synthesis,Pattern recognition,Computer science,Voice activity detection,Waveform,Sensor fusion,Speech recognition,Artificial intelligence,Memory footprint,Scalability,Computation	Conference
ISSN	Citations	PageRank
1520-6149	2	0.70
References	Authors
5	3

Authors (3 rows)

Cited by (2 rows)

References (5 rows)

Name	Order	Citations	PageRank
Masatsune Tamura	1	107	15.26
Tatsuya Mizutani	2	13	2.79
Takehiko Kagoshima	3	42	8.66

1