Title
Towards Multilingual Event Extraction Evaluation: A Case Study for the Czech Language.
Abstract
This paper presents a multilingual corpus of news, annotated with event metadata information. The events in our corpus are from the domain of violence, natural and man made disasters. The main goal of the corpus is automatic evaluation of event detection and extraction systems in different languages. As a use case, we take a rulebased event extraction system, extend it to cover a new language, Czech in our case, and evaluate it on the corpus. We explain what needs to be done to cover a new language, especially learning domain-specific dictionaries and event extraction patterns. The evaluation of the Czech system can be viewed as a starting point for further research into the evaluation of multilingual event extraction systems, which is an important stage during the development of such systems. The comparison of the performance for the Czech and English systems indicates the importance for multilingual event extraction evaluation.
Year
Venue
Field
2015
RANLP
Metadata,Czech,Computer science,Natural language processing,Artificial intelligence,Constructed language,Man-Made Disasters
DocType
Citations 
PageRank 
Conference
0
0.34
References 
Authors
4
2
Name
Order
Citations
PageRank
josef steinberger135526.95
Hristo Tanev245651.18