Rapid Development Of Tts Corpora For Four South African Languages - Citegraph

Paper Info

Title
Rapid Development Of Tts Corpora For Four South African Languages

Abstract
This paper describes the development of text-to-speech corpora for four South African languages. The approach followed investigated the possibility of using low-cost methods including informal recording environments and untrained volunteer speakers. This objective and the additional future goal of expanding the corpus to increase coverage of South Africa's 11 official languages necessitated experimenting with multi-speaker and code-switched data. The process and relevant observations are detailed throughout. The latest version of the corpora are available for download under an open-source licence and will likely sec further development and refinement in future.

Year	DOI	Venue
2017	10.21437/Interspeech.2017-1139	18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION
Keywords	Field	DocType
text-to-speech corpus, under-resourced languages	Computer science,Speech recognition,Languages of Africa,Linguistics	Conference
ISSN	Citations	PageRank
2308-457X	0	0.34
References	Authors
10	7

Authors (7 rows)

Cited by (0 rows)

References (10 rows)

Name	Order	Citations	PageRank
Daniel R. van Niekerk	1	18	4.69
Charl Johannes van Heerden	2	133	12.50
Marelie H. Davel	3	236	22.70
Neil Taylor Kleynhans	4	0	0.68
Oddur Kjartansson	5	6	4.89
Martin Jansche	6	257	23.92
Linne Ha	7	5	3.19

1