Title
Comparison of String Distance Metrics for Lemmatisation of Named Entities in Polish
Abstract
This paper presents the results of recent experiments on application of string distance metrics to the problem of named entity lemmatisation in Polish. It extends of our work in [1] by introducing new results for organisation names. Furthermore, the results presented here and in [2,3] centering around the same topic were used to make a comparative study of the average usefulness of the numerous examined string distance metrics to lemmatisation of Polish named-entities of various types. In particular, we focus on lemmatisation of country names, organisation names and person names.
Year
DOI
Venue
2007
10.1007/978-3-642-04235-5_36
Human Language Technology. Challenges of the Information Society
Keywords
DocType
Volume
organisation name,entity lemmatisation,highly inflective languages.,country name,recent experiment,string distance metrics,polish named-entities,named entities,comparative study,new result,lemmatisation,person name,average usefulness,distance metric
Conference
5603
ISSN
Citations 
PageRank 
0302-9743
2
0.41
References 
Authors
13
3
Name
Order
Citations
PageRank
Jakub Piskorski143550.04
Marcin Sydow226422.71
Karol Wieloch3262.46