Title
Speech Synthesis for Error Training Models in CALL
Abstract
A computer assisted pronunciation teaching system (CAPT) is a fundamental component in a computer assisted language learning system (CALL). A speech recognition based CAPT system often requires a large amount of speech data to train the incorrect phone models in its speech recognizer. But collecting incorrectly pronounced speech data is a labor intensive and costly work. This paper reports an effort on training the incorrect phone models by making use of synthesized speech data. A special formant speech synthesizer is designed to filter the correctly pronounced phones into incorrect phones by modifying the formant frequencies. In a Chinese Putonghua CALL system for native Cantonese speakers to learn Mandarin, a small experimental CAPT system is built with a synthetic speech data trained recognizer. Evaluation shows that a CAPT system using synthesized data can perform as good as or even better than that using real data provided that the size of the synthetic data are large enough.
Year
DOI
Venue
2009
10.1007/978-3-642-00831-3_24
ICCPOL
Keywords
Field
DocType
speech recognition,capt system,speech recognizer,incorrect phone model,synthesized data,error training models,special formant speech synthesizer,speech data,pronounced speech data,synthesized speech data,speech synthesis,synthetic data
Speech corpus,Pronunciation,Speech synthesis,Computer science,Voice activity detection,Chinese speech synthesis,Speech recognition,Phone,Natural language processing,Artificial intelligence,Formant,Speech technology
Conference
Volume
ISSN
Citations 
5459
0302-9743
0
PageRank 
References 
Authors
0.34
4
8
Name
Order
Citations
PageRank
Xin Zhang159160.75
Qin Lu268966.45
Jiping Wan300.34
Guangguang Ma410.72
Tin-shing Chiu5173.67
Weiping Ye6152.44
Wenli Zhou712.07
Qiao Li800.34