Title
Context features based pre-selection and weight prediction in concatenation speech synthesis system
Abstract
How to generate natural-sounding synthesized speech has been challenging all the researchers in speech synthesis area. Experiments show that speech concatenated by units selected from large speech corpus has a better performance. However how to limit the searching space and predict weights when calculating target cost is an important problem. This paper presents a detailed hierarchical pre-selection method to limit the searching of space. After three layers of pre-selection, a set of units are selected as the candidate units. In order to ensure the continuity in the duration, the prediction model is used in the hierarchical pre-selection. Meanwhile, M5P algorithm which is combined with decision tree and regression is presented in this paper to predict weights needed in target cost calculation. Experimental result shows that these two approaches can generate high quality speech.
Year
DOI
Venue
2014
10.1109/ISCSLP.2014.6936611
ISCSLP
Keywords
Field
DocType
weight prediction,natural-sounding synthesized speech,speech corpus,high quality speech,context feature based preselection,regression analysis,concatenation speech synthesis,speech synthesis,hierarchical pre-selection method,hierarchical pre-selection,concatenation speech synthesis system,text analysis,decision tree,decision trees,searching space,m5p algorithm
Speech corpus,Decision tree,Target costing,Speech synthesis,Pattern recognition,Regression,Computer science,Speech recognition,Artificial intelligence,Concatenation,Linear predictive coding
Conference
Citations 
PageRank 
References 
0
0.34
3
Authors
5
Name
Order
Citations
PageRank
Shanfeng Liu131.42
Zhengqi Wen28624.41
Ya Li311715.01
Jianhua Tao4848138.00
Bin Liu552.45