Building HMM based unit-selection speech synthesis system using synthetic speech naturalness evaluation score | 3 | 0.67 | 2011 |
Cross-Validation and Minimum Generation Error based Decision Tree Pruning for HMM-based Speech Synthesis. | 0 | 0.34 | 2010 |
HMM-based pseudo-clean speech synthesis for splice algorithm | 5 | 0.46 | 2010 |
An automatic language identification method based on subspace analysis | 1 | 0.35 | 2009 |
Full covariance state duration modeling for HMM-based speech synthesis | 2 | 0.49 | 2009 |
A new method for mispronunciation detection using Support Vector Machine based on Pronunciation Space Models | 26 | 1.41 | 2009 |
Integrating articulatory features into HMM-based parametric speech synthesis | 46 | 2.20 | 2009 |
Investigation on Adaptation Using Different Discriminative Training Criteria Based Linear Regression and Map | 0 | 0.34 | 2008 |
Articulatory Control Of Hmm-Based Parametric Speech Synthesis Driven By Phonetic Knowledge | 16 | 1.03 | 2008 |
Automatic mispronunciation detection for Mandarin | 9 | 0.94 | 2008 |
Double Gauss Based Unsupervised Score Normalization in Speaker Verification | 1 | 0.36 | 2008 |
Cross-Stream Dependency Modeling For Hmm-Based Speech Synthesis | 3 | 0.42 | 2008 |
Soft Margin Estimation With Various Separation Levels For Lvcsr | 1 | 0.39 | 2008 |
Exploiting Non-Target Region Information for Confidence Measure Based on Bayesian Information Criterion | 0 | 0.34 | 2008 |
Minimum word classification error training of HMMS for automatic speech recognition | 6 | 0.54 | 2008 |
Minimum generation error criterion considering global/local variance for HMM-based speech synthesis | 12 | 0.86 | 2008 |
Minumum generation error linear regression based model adaptation for HMM-based speech synthesis | 6 | 0.67 | 2008 |
Model Adaptation for HMM-Based Speech Synthesis under Minimum Generation Error Criterion | 0 | 0.34 | 2008 |
Minimum unit selection error training for HMM-based unit selection speech synthesis system | 3 | 0.45 | 2008 |
Heteroscedastic discriminant analysis with two-dimensional constraints | 1 | 0.36 | 2008 |
Pronunciation Space Models for Pronunciation Evaluation | 0 | 0.34 | 2008 |
Tone Evaluation of Chinese Continuous Speech Based on Prosodic Words. | 1 | 0.40 | 2008 |
Angle of Models Distance as Test Algorithm in Speaker Verification | 0 | 0.34 | 2007 |
Performance of Discriminative HMM Training in Noise | 0 | 0.34 | 2007 |
An efficient automatic video shot size annotation scheme | 2 | 0.44 | 2007 |
A study on soft margin estimation for LVCSR | 7 | 0.60 | 2007 |
An Interactive Video Annotation Frameowrk with Multiple Modalities | 0 | 0.34 | 2007 |
Supervised learning approach to optimize ranking function for Chinese FAQ-finder | 4 | 0.38 | 2007 |
HMM-based emotional speech synthesis using average emotion model | 7 | 0.63 | 2006 |
Noisy speech recognition performance of discriminative HMMs | 1 | 0.35 | 2006 |
Minimum Generation Error Training for HMM-Based Speech Synthesis | 70 | 3.93 | 2006 |
Signal trajectory based noise compensation for robust speech recognition | 0 | 0.34 | 2006 |
State Divergence-Based Determination of The Number of Gaussian Components of Each State in HMM | 1 | 0.37 | 2006 |
Video Annotation by Active Learning and Semi-Supervised Ensembling | 5 | 0.43 | 2006 |
Emotional speech synthesis based on improved codebook mapping voice conversion | 2 | 0.36 | 2005 |
Optimal Clustering And Non-Uniform Allocation Of Gaussian Kernels In Scalar Dimension For Hmm Compression | 6 | 0.70 | 2005 |
A novel source analysis method by matching spectral characters of LF model with STRAIGHT spectrum | 7 | 0.54 | 2005 |
Discriminative training and explicit duration modeling for HMM-based automatic segmentation | 2 | 0.39 | 2005 |
Maximum likelihood sub-band adaptation for robust speech recognition | 2 | 0.47 | 2005 |
Sliding Window Smoothing For Maximum Entropy Based Intonational Phrase Prediction In Chinese | 3 | 0.46 | 2005 |
Discriminative Training Based on the Criterion of Least Phone Competing Tokens for Large Vocabulary Speech Recognition | 2 | 0.48 | 2005 |
Region Based Multiple Frame-Rate Tradeoff Of Video Streaming | 3 | 0.46 | 2004 |
Perceptual video streaming by adaptive spatial-temporal scalability | 2 | 0.38 | 2004 |
Modeling Glottal Effect On The Spectral Envelop Of Straight Using Mixture Of Gaussians | 2 | 0.49 | 2004 |
A novel voice conversion system based on codebook mapping with phoneme-tied weighting | 6 | 0.64 | 2004 |
Chinese prosody phrase break prediction based on maximum entropy model | 14 | 0.93 | 2004 |
Mce-Based Training Of Subspace Distribution Clustering Hmm | 0 | 0.34 | 2004 |
A Superposed Prosodic Model For Chinese Text-To-Speech Synthesis | 8 | 0.83 | 2004 |
Compression of speech database by feature separation and pattern clustering using STRAIGHT | 1 | 0.40 | 2004 |
A comparative study on various confidence measures in large vocabulary speech recognition | 16 | 1.02 | 2004 |