Title
An improved tone labeling and prediction method with non-uniform segmentation of F0 contour
Abstract
This paper proposes a tone labeling technique for tonal language speech synthesis. Non-uniform segmentation using Viterbi alignment is introduced to determine the boundaries to get F0 symbols, which are used as tonal label to eliminate the mismatch between tone patterns and F0 contours of training data. During context clustering, the tendency of adjacent F0 state distributions are captured by the state-based phonetic trees. Means of tone model states are directly quantized to get full tonal label in the synthesis stage. Both objective and subjective experiment results show that the proposed technique can improve the perceptual prosody of synthetic speech of non-professional speakers.
Year
DOI
Venue
2012
10.1109/ISCSLP.2012.6423467
ISCSLP
Keywords
Field
DocType
tone pattern,f0 modeling,speech processing,perceptual prosody,tonal language speech synthesis,prediction method,f0 contour,synthetic speech,statistical analysis,nonprofessional speaker,state-based phonetic trees,statistical speech synthesis,nonuniform segmentation,speech synthesis,tone labeling,f0 generation,viterbi alignment,context clustering,f0 state distribution
Prosody,Speech processing,Language speech,Speech synthesis,Sequence labeling,Pattern recognition,Computer science,Segmentation,Speech recognition,Artificial intelligence,Cluster analysis,Viterbi algorithm
Conference
Volume
Issue
ISBN
null
null
978-1-4673-2505-9
Citations 
PageRank 
References 
0
0.34
6
Authors
4
Name
Order
Citations
PageRank
Xingyu Na1213.61
Xiang Xie221.39
Jingming Kuang338860.48
Ya-Ling He472.89