Title
Towards a comprehensive open repository of Polish language resources.
Abstract
The aim of this paper is to present current efforts towards the creation of a comprehensive open repository of Polish language resources and tools (LRTs). The work described here is carried out within the CESAR project, member of the META-NET consortium. It has already resulted in the creation of the Computational Linguistics in Poland website containing an exhaustive collection of Polish LRTs. Current work is focused on the creation of new LRTs and, esp., the enhancement of existing LRTs, such as parallel corpora, annotated corpora of written and spoken Polish and morphological dictionaries to be made available via the META-SHARE repository. Efforts are made to ensure a high level of reusability of the LTRs by adhering to widely accepted annotation and interoperability standards. Last but not least, since the great majority of the Polish CESAR resources are released under open licenses, special work is required to clarify their Intellectual Property Rights status.
Year
Venue
Keywords
2012
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
META-SHARE,parallel corpora,morphological dictionaries,annotated corpora,spoken corpora
Field
DocType
Citations 
World Wide Web,Annotation,Interoperability,Computer science,Computational linguistics,Parallel corpora,Polish,Artificial intelligence,Natural language processing,Intellectual property,Reusability
Conference
0
PageRank 
References 
Authors
0.34
5
3
Name
Order
Citations
PageRank
Maciej Ogrodniczuk13811.22
P Pezik21419.07
Adam Przepiórkowski317930.37