Title | ||
---|---|---|
Tibetan Base Noun Phrase Identification Framework Based on Chinese-Tibetan Sentence Aligned Corpus. |
Abstract | ||
---|---|---|
This paper presents an identification framework for extracting Tibetan base noun phrase (NP). The framework includes two phases. In the first phase, Chinese base NPs are extracted from all Chinese sentences in the sentence aligned Chinese-Tibetan corpus using Stanford Chinese parser. In the second phase, the Tibetan translations of those Chinese NPs are identified using four different methods, that is, word alignment, iterative re-evaluation, dictionary and word alignment, and sequence intersection method. We implemented and tested these methods on Chinese-Tibetan sentence aligned unlabelled corpus without Tibetan POS tagger and Treebank. The experimental results demonstrate these methods can get satisfactory results, and the best performance with 0.5283 precision is got using sequence intersection identification method. The identification framework can also be extended to extract Tibetan verb phrase. © 2012 The COLING. |
Year | DOI | Venue |
---|---|---|
2012 | null | COLING |
Keywords | DocType | Volume |
base noun phrase,head-phrase,tibetan information processing | Conference | null |
Issue | ISSN | Citations |
null | null | 0 |
PageRank | References | Authors |
0.34 | 0 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Minghua Nuo | 1 | 11 | 4.22 |
Huidan Liu | 2 | 16 | 5.09 |
Weina Zhao | 3 | 2 | 0.73 |
Long-Long Ma | 4 | 6 | 1.72 |
Jian Wu | 5 | 0 | 0.34 |
Zhiming Ding | 6 | 348 | 38.93 |