Speech Synthesis for Error Training Models in CALL - Citegraph

Paper Info

Title
Speech Synthesis for Error Training Models in CALL

Abstract
A computer assisted pronunciation teaching system (CAPT) is a fundamental component in a computer assisted language learning system (CALL). A speech recognition based CAPT system often requires a large amount of speech data to train the incorrect phone models in its speech recognizer. But collecting incorrectly pronounced speech data is a labor intensive and costly work. This paper reports an effort on training the incorrect phone models by making use of synthesized speech data. A special formant speech synthesizer is designed to filter the correctly pronounced phones into incorrect phones by modifying the formant frequencies. In a Chinese Putonghua CALL system for native Cantonese speakers to learn Mandarin, a small experimental CAPT system is built with a synthetic speech data trained recognizer. Evaluation shows that a CAPT system using synthesized data can perform as good as or even better than that using real data provided that the size of the synthetic data are large enough.

Year	DOI	Venue
2009	10.1007/978-3-642-00831-3_24	ICCPOL
Keywords	Field	DocType
speech recognition,capt system,speech recognizer,incorrect phone model,synthesized data,error training models,special formant speech synthesizer,speech data,pronounced speech data,synthesized speech data,speech synthesis,synthetic data	Speech corpus,Pronunciation,Speech synthesis,Computer science,Voice activity detection,Chinese speech synthesis,Speech recognition,Phone,Natural language processing,Artificial intelligence,Formant,Speech technology	Conference
Volume	ISSN	Citations
5459	0302-9743	0
PageRank	References	Authors
0.34	4	8

Authors (8 rows)

Cited by (0 rows)

References (4 rows)

Name	Order	Citations	PageRank
Xin Zhang	1	591	60.75
Qin Lu	2	689	66.45
Jiping Wan	3	0	0.34
Guangguang Ma	4	1	0.72
Tin-shing Chiu	5	17	3.67
Weiping Ye	6	15	2.44
Wenli Zhou	7	1	2.07
Qiao Li	8	0	0.34

1