Abstract | ||
---|---|---|
This paper presents the results of recent experiments on application of string distance metrics to the problem of named entity lemmatisation in Polish. It extends of our work in [1] by introducing new results for organisation names. Furthermore, the results presented here and in [2,3] centering around the same topic were used to make a comparative study of the average usefulness of the numerous examined string distance metrics to lemmatisation of Polish named-entities of various types. In particular, we focus on lemmatisation of country names, organisation names and person names. |
Year | DOI | Venue |
---|---|---|
2007 | 10.1007/978-3-642-04235-5_36 | Human Language Technology. Challenges of the Information Society |
Keywords | DocType | Volume |
organisation name,entity lemmatisation,highly inflective languages.,country name,recent experiment,string distance metrics,polish named-entities,named entities,comparative study,new result,lemmatisation,person name,average usefulness,distance metric | Conference | 5603 |
ISSN | Citations | PageRank |
0302-9743 | 2 | 0.41 |
References | Authors | |
13 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jakub Piskorski | 1 | 435 | 50.04 |
Marcin Sydow | 2 | 264 | 22.71 |
Karol Wieloch | 3 | 26 | 2.46 |