Title
Rapid Development Of Tts Corpora For Four South African Languages
Abstract
This paper describes the development of text-to-speech corpora for four South African languages. The approach followed investigated the possibility of using low-cost methods including informal recording environments and untrained volunteer speakers. This objective and the additional future goal of expanding the corpus to increase coverage of South Africa's 11 official languages necessitated experimenting with multi-speaker and code-switched data. The process and relevant observations are detailed throughout. The latest version of the corpora are available for download under an open-source licence and will likely sec further development and refinement in future.
Year
DOI
Venue
2017
10.21437/Interspeech.2017-1139
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION
Keywords
Field
DocType
text-to-speech corpus, under-resourced languages
Computer science,Speech recognition,Languages of Africa,Linguistics
Conference
ISSN
Citations 
PageRank 
2308-457X
0
0.34
References 
Authors
10
7