Title
Building support tools for Russian-language information extraction
Abstract
There is currently a paucity of publicly available NLP tools to support analysis of Russian-language text. This especially concerns higher-level applications, such as Information Extraction. We present work on tools for information extraction from text in Russian in the domain of on-line news. On the lower level we employ the AOT toolkit for natural language processing, which provides modules for morphological analysis and partial syntactic chunking. Since the outputs of both lower-level modules contain unresolved ambiguity, we synthesize the outputs and pass the result into a pre-existing English-language analysis pipeline. We describe how the information extraction system is adapted for multilingual support, including extensions to the ontologies and to the pattern matching mechanism. While this is work in progress, we present an end-to-end pipeline for event extraction from Russian-language news.
Year
DOI
Venue
2011
10.1007/978-3-642-23538-2_48
TSD
Keywords
Field
DocType
multilingual support,russian-language information extraction,pre-existing english-language analysis pipeline,end-to-end pipeline,russian-language text,information extraction system,building support tool,on-line news,information extraction,morphological analysis,event extraction,russian-language news
Computer science,Artificial intelligence,Natural language processing,Chunking (psychology),Syntax,Ambiguity,Ontology (information science),Information retrieval,Work in process,Speech recognition,Information extraction,Pattern matching,Rule of inference
Conference
Volume
ISSN
Citations 
6836
0302-9743
5
PageRank 
References 
Authors
0.49
7
6
Name
Order
Citations
PageRank
Mian Du1122.83
Peter von Etter2322.36
Mikhail Kopotev3182.14
Mikhail Novikov450.49
Natalia Tarbeeva550.49
Roman Yangarber641162.85