Title | ||
---|---|---|
A novel hybrid mandarin speech synthesis system using different base units for model training and concatenation |
Abstract | ||
---|---|---|
The hybrid speech synthesis system, which uses the acoustic model trained according to the criterion of Maximum Likelihood to select the proper candidates from the corpus, has become a hot topic in recent days. For this hybrid system, the performance is affected by the size of the base training unit and the base candidate unit. Most of existed hybrid systems use the same kind of base unit such as syllable or phone for both model training and concatenation. In Mandarin, initials and finals form the fundamental elements of pronunciation, and are always chosen as the base training unit for statistical parametric TTS system. In this paper a new hybrid Mandarin TTS system is proposed, which uses initial/final for model training and syllable for concatenation. Objective and subjective evaluations are conducted and the comparison results show that the hybrid system we proposed outperforms the traditional systems which use the same base unit for both processes with 4000 and 6000 sentences' corpus. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1109/ICASSP.2014.6853605 | ICASSP |
Keywords | Field | DocType |
mandarin speech synthesis,concatenation,hybrid speech synthesis system,acoustic model,maximum likelihood estimation,hmm,syllable,speech synthesis,maximum likelihood,model training,hidden markov models,speech,acoustics | Speech synthesis,SI base unit,Computer science,Speech recognition,Concatenation,Syllable,Hidden Markov model,Hybrid system,Mandarin Chinese,Acoustic model | Conference |
ISSN | Citations | PageRank |
1520-6149 | 1 | 0.37 |
References | Authors | |
6 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ran Zhang | 1 | 3 | 0.84 |
Jianhua Tao | 2 | 848 | 138.00 |
Ya Li | 3 | 40 | 3.68 |
Zhengqi Wen | 4 | 86 | 24.41 |