Abstract | ||
---|---|---|
Text-to-speech synthesis system has been widely studied for many languages. However, speech synthesis for Arabic language has not sufficient progresses and it is still in its first stage. Statistical parametric synthesis based on hidden Markov models was the most commonly applied approach for Arabic language. Recently, synthesized speech quality based on deep neural networks was found as intelligible as human voice. This paper describes a Text-To-Speech (TTS) synthesis system for modern standard Arabic language based on statistical parametric approach and Mel-cepstral coefficients. Deep neural networks achieved state-of-the-art performance in a wide range of tasks, including speech synthesis. Our ITS system includes a diacritization system which is very important for Arabic TTS application. Our diacritization system is also based on deep neural networks. In addition to the use deep techniques, different methods were also proposed to model the acoustic parameters in order to address the problem of acoustic models accuracy. They are based on linguistic and acoustic characteristics (e.g. letter position based diacritization system, unit types based synthesis system, diacritic marks based synthesis system) and based on deep learning techniques (stacked generalization techniques). Experimental results show that our diacritization system can generate a diacritized text with high accuracy. As regards the speech synthesis system, the experimental results and subjective evaluation show that our proposed method for synthesis system can generate intelligible and natural speech. (C) 2015 Elsevier Ltd. All rights reserved. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1016/j.csl.2015.04.002 | Computer Speech & Language |
Keywords | Field | DocType |
Text-to-speech synthesis,Statistical parametric,Deep neural networks,Natural language processing,Diacritization system | Speech synthesis,Human voice,Unit type,Computer science,Speech recognition,Parametric statistics,Modern Standard Arabic,Diacritic,Artificial intelligence,Natural language processing,Deep learning,Hidden Markov model | Journal |
Volume | Issue | ISSN |
34 | 1 | 0885-2308 |
Citations | PageRank | References |
3 | 0.48 | 17 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
ilyes rebai | 1 | 13 | 3.65 |
Yassine BenAyed | 2 | 19 | 2.58 |