Esophageal Speech Enhancement Based On Statistical Voice Conversion With Gaussian Mixture Models - Citegraph

Paper Info

Title
Esophageal Speech Enhancement Based On Statistical Voice Conversion With Gaussian Mixture Models

Abstract
This paper presents a novel method of enhancing esophageal speech using statistical voice conversion. Esophageal speech is one of the alternative speaking methods for laryngectomees. Although it doesn't require any external devices, generated voices usually sound unnatural compared with normal speech. To improve the intelligibility and naturalness of esophageal speech, we propose a voice conversion method from esophageal speech into normal speech. A spectral parameter and excitation parameters of target normal speech are separately estimated from a spectral parameter of the esophageal speech based on Gaussian mixture models. The experimental results demonstrate that the proposed method yields significant improvements in intelligibility and naturalness. We also apply one-to-many eigenvoice conversion to esophageal speech enhancement to make it possible to flexibly control the voice quality of enhanced speech.

Year	DOI	Venue
2010	10.1587/transinf.E93.D.2472	IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS
Keywords	Field	DocType
laryngectomees, esophageal speech, speech enhancement, voice conversion, eigenvoice conversion	Speech enhancement,Speech processing,Voice analysis,Esophageal speech,Pattern recognition,Computer science,Voice activity detection,PSQM,Speech recognition,Artificial intelligence,Linear predictive coding,Intelligibility (communication)	Journal
Volume	Issue	ISSN
E93D	9	1745-1361
Citations	PageRank	References
15	0.77	7
Authors
5

Authors (5 rows)

Cited by (15 rows)

References (7 rows)

Name	Order	Citations	PageRank
Hironori Doi	1	45	3.34
Keigo Nakamura	2	103	7.60
Tomoki Toda	3	1874	167.18
Saruwatari, H.	4	652	90.81
Kiyohiro Shikano	5	2662	928.81

1