A novel hybrid mandarin speech synthesis system using different base units for model training and concatenation - Citegraph

Paper Info

Title
A novel hybrid mandarin speech synthesis system using different base units for model training and concatenation

Abstract
The hybrid speech synthesis system, which uses the acoustic model trained according to the criterion of Maximum Likelihood to select the proper candidates from the corpus, has become a hot topic in recent days. For this hybrid system, the performance is affected by the size of the base training unit and the base candidate unit. Most of existed hybrid systems use the same kind of base unit such as syllable or phone for both model training and concatenation. In Mandarin, initials and finals form the fundamental elements of pronunciation, and are always chosen as the base training unit for statistical parametric TTS system. In this paper a new hybrid Mandarin TTS system is proposed, which uses initial/final for model training and syllable for concatenation. Objective and subjective evaluations are conducted and the comparison results show that the hybrid system we proposed outperforms the traditional systems which use the same base unit for both processes with 4000 and 6000 sentences' corpus.

Year	DOI	Venue
2014	10.1109/ICASSP.2014.6853605	ICASSP
Keywords	Field	DocType
mandarin speech synthesis,concatenation,hybrid speech synthesis system,acoustic model,maximum likelihood estimation,hmm,syllable,speech synthesis,maximum likelihood,model training,hidden markov models,speech,acoustics	Speech synthesis,SI base unit,Computer science,Speech recognition,Concatenation,Syllable,Hidden Markov model,Hybrid system,Mandarin Chinese,Acoustic model	Conference
ISSN	Citations	PageRank
1520-6149	1	0.37
References	Authors
6	4

Authors (4 rows)

Cited by (1 rows)

References (6 rows)

Name	Order	Citations	PageRank
Ran Zhang	1	3	0.84
Jianhua Tao	2	848	138.00
Ya Li	3	40	3.68
Zhengqi Wen	4	86	24.41

1