Modulation Spectrum-Constrained Trajectory Training Algorithm For Gmm-Based Voice Conversion - Citegraph

Paper Info

Title
Modulation Spectrum-Constrained Trajectory Training Algorithm For Gmm-Based Voice Conversion

Abstract
This paper presents a novel training algorithm for Gaussian Mixture Model (GMM) -based Voice Conversion (VC). One of the advantages of GMM-based VC is computationally efficient conversion processing enabling to achieve real-time VC applications. On the other hand, the quality of the converted speech is still significantly worse than that of natural speech. In order to address this problem while preserving the computationally efficient conversion processing, the proposed training method enables 1) to use a consistent optimization criterion between training and conversion and 2) to compensate a Modulation Spectrum (MS) of the converted parameter trajectory as a feature sensitively correlated with over-smoothing effects causing quality degradation of the converted speech. The experimental results demonstrate that the proposed algorithm yields significant improvements in term of both the converted speech quality and the conversion accuracy for speaker individuality compared to the basic training algorithm.

Year	DOI	Venue
2015	10.1109/ICASSP.2015.7178894	2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP)
Keywords	Field	DocType
GMM-based voice conversion, over-smoothing, modulation spectrum. training algorithm	Hafnium,Pragmatics,Pattern recognition,Computer science,Speech quality,Algorithm,Speech recognition,Artificial intelligence,Mixture model,Trajectory,Modulation spectrum	Conference
ISSN	Citations	PageRank
1520-6149	7	0.44
References	Authors
19	4

Authors (4 rows)

Cited by (7 rows)

References (19 rows)

Name	Order	Citations	PageRank
Shinnosuke Takamichi	1	75	22.08
Tomoki Toda	2	1874	167.18
Alan W. Black	3	4391	742.28
Satoshi Nakamura	4	1099	194.59

1