Title
A novel hybrid mandarin speech synthesis system using different base units for model training and concatenation
Abstract
The hybrid speech synthesis system, which uses the acoustic model trained according to the criterion of Maximum Likelihood to select the proper candidates from the corpus, has become a hot topic in recent days. For this hybrid system, the performance is affected by the size of the base training unit and the base candidate unit. Most of existed hybrid systems use the same kind of base unit such as syllable or phone for both model training and concatenation. In Mandarin, initials and finals form the fundamental elements of pronunciation, and are always chosen as the base training unit for statistical parametric TTS system. In this paper a new hybrid Mandarin TTS system is proposed, which uses initial/final for model training and syllable for concatenation. Objective and subjective evaluations are conducted and the comparison results show that the hybrid system we proposed outperforms the traditional systems which use the same base unit for both processes with 4000 and 6000 sentences' corpus.
Year
DOI
Venue
2014
10.1109/ICASSP.2014.6853605
ICASSP
Keywords
Field
DocType
mandarin speech synthesis,concatenation,hybrid speech synthesis system,acoustic model,maximum likelihood estimation,hmm,syllable,speech synthesis,maximum likelihood,model training,hidden markov models,speech,acoustics
Speech synthesis,SI base unit,Computer science,Speech recognition,Concatenation,Syllable,Hidden Markov model,Hybrid system,Mandarin Chinese,Acoustic model
Conference
ISSN
Citations 
PageRank 
1520-6149
1
0.37
References 
Authors
6
4
Name
Order
Citations
PageRank
Ran Zhang130.84
Jianhua Tao2848138.00
Ya Li3403.68
Zhengqi Wen48624.41