Title | ||
---|---|---|
Towards Multilingual Event Extraction Evaluation: A Case Study for the Czech Language. |
Abstract | ||
---|---|---|
This paper presents a multilingual corpus of news, annotated with event metadata information. The events in our corpus are from the domain of violence, natural and man made disasters. The main goal of the corpus is automatic evaluation of event detection and extraction systems in different languages. As a use case, we take a rulebased event extraction system, extend it to cover a new language, Czech in our case, and evaluate it on the corpus. We explain what needs to be done to cover a new language, especially learning domain-specific dictionaries and event extraction patterns. The evaluation of the Czech system can be viewed as a starting point for further research into the evaluation of multilingual event extraction systems, which is an important stage during the development of such systems. The comparison of the performance for the Czech and English systems indicates the importance for multilingual event extraction evaluation. |
Year | Venue | Field |
---|---|---|
2015 | RANLP | Metadata,Czech,Computer science,Natural language processing,Artificial intelligence,Constructed language,Man-Made Disasters |
DocType | Citations | PageRank |
Conference | 0 | 0.34 |
References | Authors | |
4 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
josef steinberger | 1 | 355 | 26.95 |
Hristo Tanev | 2 | 456 | 51.18 |