Tibetan Base Noun Phrase Identification Framework Based on Chinese-Tibetan Sentence Aligned Corpus. - Citegraph

Paper Info

Title
Tibetan Base Noun Phrase Identification Framework Based on Chinese-Tibetan Sentence Aligned Corpus.

Abstract
This paper presents an identification framework for extracting Tibetan base noun phrase (NP). The framework includes two phases. In the first phase, Chinese base NPs are extracted from all Chinese sentences in the sentence aligned Chinese-Tibetan corpus using Stanford Chinese parser. In the second phase, the Tibetan translations of those Chinese NPs are identified using four different methods, that is, word alignment, iterative re-evaluation, dictionary and word alignment, and sequence intersection method. We implemented and tested these methods on Chinese-Tibetan sentence aligned unlabelled corpus without Tibetan POS tagger and Treebank. The experimental results demonstrate these methods can get satisfactory results, and the best performance with 0.5283 precision is got using sequence intersection identification method. The identification framework can also be extended to extract Tibetan verb phrase. © 2012 The COLING.

Year	DOI	Venue
2012	null	COLING
Keywords	DocType	Volume
base noun phrase,head-phrase,tibetan information processing	Conference	null
Issue	ISSN	Citations
null	null	0
PageRank	References	Authors
0.34	0	6

Authors (6 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Minghua Nuo	1	11	4.22
Huidan Liu	2	16	5.09
Weina Zhao	3	2	0.73
Long-Long Ma	4	6	1.72
Jian Wu	5	0	0.34
Zhiming Ding	6	348	38.93

1