Abstract | ||
---|---|---|
A text-to-speech synthesizer that would produce easily understandable voices at very fast speaking rates is expected to help persons with visual disability to acquire information effectively with screen reading softwares. We investigated the intelligibility of Japanese Text-to-Speech systems at fast speaking rates, using four-digit random numbers as the vocabulary of the recall test.We also studied the fast and intelligible text-to-speech engine, using HMM-based synthesizer with the corpus with fast speaking rate. As the results.. the statistical models trained with the fast speaking corpus was effective. The learning effect was significant in the early stage of the trials and the effect sustained for several weeks. |
Year | DOI | Venue |
---|---|---|
2006 | 10.1109/IEMBS.2006.260473 | 2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15 |
Keywords | DocType | Volume |
speech intelligibility,statistical models,speech synthesis,hidden markov models,text to speech,learning effect,statistical model,listening | Conference | 1 |
ISSN | Citations | PageRank |
1557-170X | 4 | 0.65 |
References | Authors | |
0 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Takuya Nishimoto | 1 | 227 | 28.95 |
Shinji Sako | 2 | 181 | 15.69 |
Shigeki Sagayama | 3 | 1217 | 137.97 |
Kazue Ohshima | 4 | 4 | 0.65 |
Koichi Oda | 5 | 4 | 0.65 |
Takayuki Watanabe | 6 | 28 | 5.52 |