Title
Atlas - The Multilingual Language Processing Platform
Abstract
This paper presents the ATLAS platform - multilingual language processing framework integrating the common set of linguistic tools for a group of European languages (less-resourced: Bulgarian, Croatian, Greek, Polish and Romanian together with English and German as reference languages). State-of-the-art NLP functionality offered by the platform allows for multilingual annotation of texts on lower levels (segmentation, morphosyntax) which in turn supports higher-level processing such as automated categorization, information extraction, machine translation or summarization. More elaborate annotation methods such as named entity extraction or multiword unit lemmatization are also available. Multilevel annotation of texts is governed by language processing chains constructed with UIMA (Unstructured Information Management Application) industry standard.To demonstrate capabilities of the framework, three linguistically-aware online services have been built on top of it: i-Publisher (Web-based content management platform), i-Librarian (a digital library of scientific works) and EUDocLib (site for browsing and searching through EUR-LEX documents).
Year
Venue
Keywords
2011
PROCESAMIENTO DEL LENGUAJE NATURAL
linguistic tools, language resources, Web services, content management system, online services, UIMA
Field
DocType
Volume
World Wide Web,Commission,Computer science,Information and Communications Technology,Web service,Content management system
Journal
47
Issue
ISSN
Citations 
47
1135-5948
0
PageRank 
References 
Authors
0.34
0
2
Name
Order
Citations
PageRank
Maciej Ogrodniczuk13811.22
Diman Karagiozov200.68