Title
A case study on using speech-to-translation alignments for language documentation.
Abstract
For many low-resource or endangered languages, spoken language resources are more likely to be annotated with translations than with transcriptions. Recent work exploits such annotations to produce speech-to-translation alignments, without access to any text transcriptions. We investigate whether providing such information can aid in producing better (mismatched) crowdsourced transcriptions, which in turn could be valuable for training speech recognition systems, and show that they can indeed be beneficial through a small-scale case study as a proof-of-concept. We also present a simple phonetically aware string averaging technique that produces transcriptions of higher quality.
Year
DOI
Venue
2017
10.18653/v1/W17-0123
arXiv: Computation and Language
Field
DocType
Volume
Transcription (linguistics),Computer science,Speech recognition,Exploit,Natural language processing,Language documentation,Artificial intelligence,Spoken language
Journal
abs/1702.04372
Citations 
PageRank 
References 
4
0.49
10
Authors
2
Name
Order
Citations
PageRank
Antonios Anastasopoulos112217.13
David Chiang22843144.76