Abstract | ||
---|---|---|
A computer assisted pronunciation teaching system (CAPT) is a fundamental component in a computer assisted language learning system (CALL). A speech recognition based CAPT system often requires a large amount of speech data to train the incorrect phone models in its speech recognizer. But collecting incorrectly pronounced speech data is a labor intensive and costly work. This paper reports an effort on training the incorrect phone models by making use of synthesized speech data. A special formant speech synthesizer is designed to filter the correctly pronounced phones into incorrect phones by modifying the formant frequencies. In a Chinese Putonghua CALL system for native Cantonese speakers to learn Mandarin, a small experimental CAPT system is built with a synthetic speech data trained recognizer. Evaluation shows that a CAPT system using synthesized data can perform as good as or even better than that using real data provided that the size of the synthetic data are large enough. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1007/978-3-642-00831-3_24 | ICCPOL |
Keywords | Field | DocType |
speech recognition,capt system,speech recognizer,incorrect phone model,synthesized data,error training models,special formant speech synthesizer,speech data,pronounced speech data,synthesized speech data,speech synthesis,synthetic data | Speech corpus,Pronunciation,Speech synthesis,Computer science,Voice activity detection,Chinese speech synthesis,Speech recognition,Phone,Natural language processing,Artificial intelligence,Formant,Speech technology | Conference |
Volume | ISSN | Citations |
5459 | 0302-9743 | 0 |
PageRank | References | Authors |
0.34 | 4 | 8 |
Name | Order | Citations | PageRank |
---|---|---|---|
Xin Zhang | 1 | 591 | 60.75 |
Qin Lu | 2 | 689 | 66.45 |
Jiping Wan | 3 | 0 | 0.34 |
Guangguang Ma | 4 | 1 | 0.72 |
Tin-shing Chiu | 5 | 17 | 3.67 |
Weiping Ye | 6 | 15 | 2.44 |
Wenli Zhou | 7 | 1 | 2.07 |
Qiao Li | 8 | 0 | 0.34 |