Title
Dependency Parsing With Lattice Structures For Resource-Poor Languages
Abstract
In this paper, we present a new dependency parsing method for languages which have very small annotated corpus and for which methods of segmentation and morphological analysis producing a unique (automatically disambiguated) result are very unreliable. Our method works on a morphosyntactic lattice factorizing all possible segmentation and part-of-speech tagging results. The quality of the input to syntactic analysis is hence much better than that of an unreliable unique sequence of lemmatized and tagged words. We propose an adaptation of Eisner's algorithm for finding the k-best dependency trees in a morphosyntactic lattice structure encoding multiple results of morphosyntactic analysis. Moreover, we present how to use Dependency Insertion Grammar in order to adjust the scores and filter out invalid trees, the use of language model to rescore the parse trees and the k-best extension of our parsing model. The highest parsing accuracy reported in this paper is 74.32% which represents a 6.31% improvement compared to the model taking the input from the unreliable morphosyntactic analysis tools.
Year
DOI
Venue
2009
10.1587/transinf.E92.D.2122
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS
Keywords
Field
DocType
dependency parser, under-resourced languages, morphosyntactic lattice structure, Dependency Insertion Grammar, k-best parsing
Annotation,Computer science,Segmentation,Speech recognition,Grammar,Coding (social sciences),Dependency grammar,Artificial intelligence,Natural language processing,Parsing,Language model,Encoding (memory)
Journal
Volume
Issue
ISSN
E92D
10
1745-1361
Citations 
PageRank 
References 
1
0.39
13
Authors
4
Name
Order
Citations
PageRank
Sutee Sudprasert110.39
asanee kawtrakul216125.90
Christian Boitet324350.18
Vincent Berment4103.47