Title
Adaptive Wavenet Vocoder for Residual Compensation in GAN-Based Voice Conversion.
Abstract
In this paper, we propose to use generative adversarial networks (GAN) together with a WaveNet vocoder to address the over-smoothing problem arising from the deep learning approaches to voice conversion, and to improve the vocoding quality over the traditional vocoders. As GAN aims to minimize the divergence between the natural and converted speech parameters, it effectively alleviates the over-smoothing problem in the converted speech. On the other hand, WaveNet vocoder allows us to leverage from the human speech of a large speaker population, thus improving the naturalness of the synthetic voice. Furthermore, for the first time, we study how to use WaveNet vocoder for residual compensation to improve the voice conversion performance. The experiments show that the proposed voice conversion framework consistently outperforms the baselines.
Year
DOI
Venue
2018
10.1109/SLT.2018.8639507
SLT
Keywords
Field
DocType
Vocoders,Training,Gallium nitride,Generative adversarial networks,Training data,Speech enhancement
Training set,Speech enhancement,Population,Residual,Computer science,Naturalness,Speech recognition,Artificial intelligence,Deep learning
Conference
ISSN
ISBN
Citations 
2639-5479
978-1-5386-4334-1
8
PageRank 
References 
Authors
0.43
0
5
Name
Order
Citations
PageRank
Berrak Sisman16010.34
Mingyang Zhang210410.61
Sakriani Sakti325765.02
Haizhou Li43678334.61
Satoshi Nakamura51099194.59