Title
Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory.
Abstract
We present low-resource spoken keyword search (KWS) strategies guided by distinctive feature theory in linguistics to conduct data selection, feature selection, and transcription augmentation. These strategies were employed in the context of the 2016 NIST Open Keyword Search Evaluation (OpenKWS16) using conversational Georgian from the IARPA Babel program. In particular, we elaborate on the following: (1) We exploit glottal-source-related acoustic features that characterize Georgian ejective phonemes ([+constricted glottis], [+raised larynx ejective] specified in distinctive feature theory). These features complement standard acoustic features, leading to a relative fusion gain of 11.9%. (2) We use noisy channel models to incorporate probabilistic phonetic transcriptions from mismatched crowdsourcing to conduct transfer learning to improve KWS for extremely under-resourced conditions (24 min of transcribed Georgian), achieving a relative improvement of 118% over the baseline and a relative fusion gain of 32%.(3) Using distinctive feature analysis, we select a compact subset of source languages used in past evaluations to ensure high phonetic coverage for cross-lingual acoustic modeling when only limited system development time and computational resources are available. This strategy leads to comparable performance to using all available linguistic resources when only 1/3 of the source languages were chosen.
Year
Venue
Field
2017
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference
Georgian,Transcription (linguistics),Feature selection,Computer science,Crowdsourcing,Transfer of learning,Feature extraction,Natural language processing,Distinctive feature,Artificial intelligence,Probabilistic logic
DocType
ISSN
Citations 
Conference
2309-9402
1
PageRank 
References 
Authors
0.35
0
13
Name
Order
Citations
PageRank
Nancy F. Chen112028.98
Boon Pang Lim2628.89
Van Hai Do310.68
Van Tung Pham4408.42
Chongjia Ni5408.63
Haihua Xu65511.41
Mark Hasegawa-Johnson71189112.85
Wenda Chen853.13
Xiong Xiao928134.97
Sunil Sivadas1016919.71
Eng Siong Chng11970106.33
Bin Ma12605.69
Haizhou Li133678334.61