Title
Woefzela - An Open-Source Platform For Asr Data Collection In The Developing World
Abstract
Building transcribed speech corpora for under-resourced languages plays a pivotal role in developing speech technologies for such languages. We have developed an open-source tool for devices running the Android operating system to facilitate the efficient collection of speech data for Automatic Speech Recognition system development. The tool was designed for use in typical developing-world conditions; we present the relevant design choices and analyse the effectiveness of this tool by means of a case study. In particular, we introduce a novel semi-real-time quality monitoring system, which increases the efficiency of the data collection process.
Year
Venue
Keywords
2011
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5
speech resource collection, automatic speech recognition, developing world, resource-scarce environment, under-resourced languages, android
Field
DocType
Citations 
Data collection,Speech analytics,Android (operating system),Monitoring system,Computer science,Developing country,Speech recognition,System development
Conference
11
PageRank 
References 
Authors
0.98
1
5
Name
Order
Citations
PageRank
Nic J. De Vries1302.32
Jaco Badenhorst2454.77
Marelie H. Davel323622.70
Etienne Barnard443857.85
Alta de Waal5425.68