Title | ||
---|---|---|
Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory. |
Abstract | ||
---|---|---|
We present low-resource spoken keyword search (KWS) strategies guided by distinctive feature theory in linguistics to conduct data selection, feature selection, and transcription augmentation. These strategies were employed in the context of the 2016 NIST Open Keyword Search Evaluation (OpenKWS16) using conversational Georgian from the IARPA Babel program. In particular, we elaborate on the following: (1) We exploit glottal-source-related acoustic features that characterize Georgian ejective phonemes ([+constricted glottis], [+raised larynx ejective] specified in distinctive feature theory). These features complement standard acoustic features, leading to a relative fusion gain of 11.9%. (2) We use noisy channel models to incorporate probabilistic phonetic transcriptions from mismatched crowdsourcing to conduct transfer learning to improve KWS for extremely under-resourced conditions (24 min of transcribed Georgian), achieving a relative improvement of 118% over the baseline and a relative fusion gain of 32%.(3) Using distinctive feature analysis, we select a compact subset of source languages used in past evaluations to ensure high phonetic coverage for cross-lingual acoustic modeling when only limited system development time and computational resources are available. This strategy leads to comparable performance to using all available linguistic resources when only 1/3 of the source languages were chosen. |
Year | Venue | Field |
---|---|---|
2017 | Asia-Pacific Signal and Information Processing Association Annual Summit and Conference | Georgian,Transcription (linguistics),Feature selection,Computer science,Crowdsourcing,Transfer of learning,Feature extraction,Natural language processing,Distinctive feature,Artificial intelligence,Probabilistic logic |
DocType | ISSN | Citations |
Conference | 2309-9402 | 1 |
PageRank | References | Authors |
0.35 | 0 | 13 |
Name | Order | Citations | PageRank |
---|---|---|---|
Nancy F. Chen | 1 | 120 | 28.98 |
Boon Pang Lim | 2 | 62 | 8.89 |
Van Hai Do | 3 | 1 | 0.68 |
Van Tung Pham | 4 | 40 | 8.42 |
Chongjia Ni | 5 | 40 | 8.63 |
Haihua Xu | 6 | 55 | 11.41 |
Mark Hasegawa-Johnson | 7 | 1189 | 112.85 |
Wenda Chen | 8 | 5 | 3.13 |
Xiong Xiao | 9 | 281 | 34.97 |
Sunil Sivadas | 10 | 169 | 19.71 |
Eng Siong Chng | 11 | 970 | 106.33 |
Bin Ma | 12 | 60 | 5.69 |
Haizhou Li | 13 | 3678 | 334.61 |