Abstract | ||
---|---|---|
Digitised Cultural Heritage (CH) items usually have short descriptions and lack rich contextual information. Wikipedia articles, on the contrary, include in-depth descriptions and links to related articles, which motivate the enrichment of CH items with information from Wikipedia. In this paper we explore the feasibility of finding matching articles in Wikipedia for a given Cultural Heritage item. We manually annotated a random sample of items from Europeana, and performed a qualitative and quantitative study of the issues and problems that arise, showing that each kind of CH item is different and needs a nuanced definition of what "matching article" means. In addition, we test a well-known wikification (aka entity linking) algorithm on the task. Our results indicate that a substantial number of items can be effectively linked to their corresponding Wikipedia article. |
Year | Venue | Keywords |
---|---|---|
2012 | LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | Cultural Heritage,Corpus annotation,Wikification |
DocType | Citations | PageRank |
Conference | 11 | 0.76 |
References | Authors | |
9 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Eneko Agirre | 1 | 3119 | 217.33 |
Ander Barrena | 2 | 30 | 5.04 |
Oier Lopez De Lacalle | 3 | 382 | 25.08 |
Aitor Soroa | 4 | 1121 | 59.72 |
Samuel Fernando | 5 | 87 | 12.71 |
Mark Stevenson | 6 | 970 | 91.03 |