Title
Tibetan Base Noun Phrase Identification Framework Based on Chinese-Tibetan Sentence Aligned Corpus.
Abstract
This paper presents an identification framework for extracting Tibetan base noun phrase (NP). The framework includes two phases. In the first phase, Chinese base NPs are extracted from all Chinese sentences in the sentence aligned Chinese-Tibetan corpus using Stanford Chinese parser. In the second phase, the Tibetan translations of those Chinese NPs are identified using four different methods, that is, word alignment, iterative re-evaluation, dictionary and word alignment, and sequence intersection method. We implemented and tested these methods on Chinese-Tibetan sentence aligned unlabelled corpus without Tibetan POS tagger and Treebank. The experimental results demonstrate these methods can get satisfactory results, and the best performance with 0.5283 precision is got using sequence intersection identification method. The identification framework can also be extended to extract Tibetan verb phrase. © 2012 The COLING.
Year
DOI
Venue
2012
null
COLING
Keywords
DocType
Volume
base noun phrase,head-phrase,tibetan information processing
Conference
null
Issue
ISSN
Citations 
null
null
0
PageRank 
References 
Authors
0.34
0
6
Name
Order
Citations
PageRank
Minghua Nuo1114.22
Huidan Liu2165.09
Weina Zhao320.73
Long-Long Ma461.72
Jian Wu500.34
Zhiming Ding634838.93