Knowledge-driven event extraction in Russian: corpus-based linguistic resources - Citegraph

Paper Info

Title
Knowledge-driven event extraction in Russian: corpus-based linguistic resources

Abstract
AbstractAutomatic event extraction form text is an important step in knowledge acquisition and knowledge base population. Manual work in development of extraction system is indispensable either in corpus annotation or in vocabularies and pattern creation for a knowledge-based system. Recent works have been focused on adaptation of existing system (for extraction from English texts) to new domains. Event extraction in other languages was not studied due to the lack of resources and algorithms necessary for natural language processing. In this paper we define a set of linguistic resources that are necessary in development of a knowledge-based event extraction system in Russian: a vocabulary of subordination models, a vocabulary of event triggers, and a vocabulary of Frame Elements that are basic building blocks for semantic patterns. We propose a set of methods for creation of such vocabularies in Russian and other languages using Google Books NGram Corpus. The methods are evaluated in development of event extraction system for Russian.

Year	DOI	Venue
2016	10.1155/2016/4183760	Periodicals
Field	DocType	Volume
Population,Annotation,Computer science,Natural language processing,Knowledge extraction,Artificial intelligence,Knowledge base,Linguistics,Vocabulary,Knowledge acquisition	Journal	2016
Issue	ISSN	Citations
1	1687-5265	3
PageRank	References	Authors
0.52	13	2

Authors (2 rows)

Cited by (3 rows)

References (13 rows)

Name	Order	Citations	PageRank
Valery Solovyev	1	38	10.57
Vladimir Ivanov	2	30	11.48

1