Title
Polish language processing chains for multilingual information systems
Abstract
The ATLAS project, started in March 2010, intends to create a multilingual language processing framework integrating the common set of linguistic tools for a group of European languages, among them Polish. The chained tools producing multi-level UIMA-encoded annotation of texts can be used by NLP applications for complex language-intensive operations such as automated categorization, information extraction, machine translation or summarization. This paper concentrates on applications of ATLAS language processing chains to multilingual information systems, with particular interest in processing Polish. Inflectional characteristics of this language offers the possibility to comment on a few more advanced functions such as multiword unit lemmatisation, vital for real-life presentation of extracted phrases. Several sample applications using the NLP chain are also presented.
Year
DOI
Venue
2012
10.1007/978-3-642-31178-9_14
NLDB
Keywords
Field
DocType
atlas language processing chain,european language,nlp chain,multilingual language processing framework,inflectional characteristic,polish language processing chain,information system,advanced function,atlas project,nlp application,multilingual information system,information extraction
Noun phrase,Information system,Automatic summarization,Lemmatisation,Latent Dirichlet allocation,Computer science,Machine translation,Polish,Information extraction,Artificial intelligence,Natural language processing
Conference
Citations 
PageRank 
References 
0
0.34
10
Authors
2
Name
Order
Citations
PageRank
Maciej Ogrodniczuk13811.22
Adam Przepiórkowski217930.37