Abstract | ||
---|---|---|
This paper presents the ATLAS platform - multilingual language processing framework integrating the common set of linguistic tools for a group of European languages (less-resourced: Bulgarian, Croatian, Greek, Polish and Romanian together with English and German as reference languages). State-of-the-art NLP functionality offered by the platform allows for multilingual annotation of texts on lower levels (segmentation, morphosyntax) which in turn supports higher-level processing such as automated categorization, information extraction, machine translation or summarization. More elaborate annotation methods such as named entity extraction or multiword unit lemmatization are also available. Multilevel annotation of texts is governed by language processing chains constructed with UIMA (Unstructured Information Management Application) industry standard.To demonstrate capabilities of the framework, three linguistically-aware online services have been built on top of it: i-Publisher (Web-based content management platform), i-Librarian (a digital library of scientific works) and EUDocLib (site for browsing and searching through EUR-LEX documents). |
Year | Venue | Keywords |
---|---|---|
2011 | PROCESAMIENTO DEL LENGUAJE NATURAL | linguistic tools, language resources, Web services, content management system, online services, UIMA |
Field | DocType | Volume |
World Wide Web,Commission,Computer science,Information and Communications Technology,Web service,Content management system | Journal | 47 |
Issue | ISSN | Citations |
47 | 1135-5948 | 0 |
PageRank | References | Authors |
0.34 | 0 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Maciej Ogrodniczuk | 1 | 38 | 11.22 |
Diman Karagiozov | 2 | 0 | 0.68 |