Abstract | ||
---|---|---|
Building transcribed speech corpora for under-resourced languages plays a pivotal role in developing speech technologies for such languages. We have developed an open-source tool for devices running the Android operating system to facilitate the efficient collection of speech data for Automatic Speech Recognition system development. The tool was designed for use in typical developing-world conditions; we present the relevant design choices and analyse the effectiveness of this tool by means of a case study. In particular, we introduce a novel semi-real-time quality monitoring system, which increases the efficiency of the data collection process. |
Year | Venue | Keywords |
---|---|---|
2011 | 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | speech resource collection, automatic speech recognition, developing world, resource-scarce environment, under-resourced languages, android |
Field | DocType | Citations |
Data collection,Speech analytics,Android (operating system),Monitoring system,Computer science,Developing country,Speech recognition,System development | Conference | 11 |
PageRank | References | Authors |
0.98 | 1 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Nic J. De Vries | 1 | 30 | 2.32 |
Jaco Badenhorst | 2 | 45 | 4.77 |
Marelie H. Davel | 3 | 236 | 22.70 |
Etienne Barnard | 4 | 438 | 57.85 |
Alta de Waal | 5 | 42 | 5.68 |