Title
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing.
Abstract
We present a new version of the Croatian Dependency Treebank. It constitutes a slight departure from the previously closely observed Prague Dependency Treebank syntactic layer annotation guidelines as we introduce a new subset of syntactic tags on top of the existing tagset. These new tags are used in explicit annotation of subordinate clauses via subordinate conjunctions. Introducing the new annotation to Croatian Dependency Treebank, we also modify head attachment rules addressing subordinate conjunctions and subordinate clause predicates. In an experiment with data-driven dependency parsing, we show that implementing these new annotation guidelines leeds to a statistically significant improvement in parsing accuracy. We also observe a substantial improvement in inter-annotator agreement, facilitating more consistent annotation in further treebank development.
Year
Venue
Keywords
2014
LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
dependency treebank,dependency parsing,Croatian language
Field
DocType
Citations 
Annotation,Computer science,Dependency grammar,Treebank,Artificial intelligence,Natural language processing,Parsing,Dependent clause,Predicate (grammar),Syntax
Conference
0
PageRank 
References 
Authors
0.34
11
4
Name
Order
Citations
PageRank
Zeljko Agic115920.44
Daša Berović2121.77
Danijela Merkler3232.66
Marko Tadić48015.61