Title
Towards a more careful evaluation of broad coverage parsing systems
Abstract
Since treebanks have become available to researchers a wide variety of techniques has been used to make broad coverage parsing systems. This makes quantitative evaluation very important, but the current evaluation methods have a number of drawbacks such as arbitrary choices in the treebank and the difficulty in measuring statistical significance. We suggest a more detailed method for testing a parsing system using constituent boundaries, with a number of measures that give more information than current measures, and evaluate the quality of the test. We also show that statistical significance cannot be calculated in a straightforward way, and suggest a calculation method for the case of Bracket Recall.
Year
DOI
Venue
1996
10.3115/992628.992725
COLING
Keywords
Field
DocType
constituent boundary,detailed method,current measure,arbitrary choice,calculation method,current evaluation method,careful evaluation,parsing system,broad coverage,statistical significance,quantitative evaluation
Computer science,Treebank,Artificial intelligence,Natural language processing,Parsing,Recall
Conference
Volume
Citations 
PageRank 
C96-1
1
0.39
References 
Authors
6
2
Name
Order
Citations
PageRank
Wide R. Hogenhout182.31
yuji matsumoto23008300.05