Title | ||
---|---|---|
Toponym disambiguation in historical documents using semantic and geographic features |
Abstract | ||
---|---|---|
Historians are often interested in the locations mentioned in digitized collections. However, place names are highly ambiguous and may change over time, which makes it especially hard to automatically ground mentions of places in historical texts to their real-world referents. Toponym disambiguation is a challenging problem in natural language processing, and has been approached in two different yet related tasks: toponym resolution and entity linking. In this paper, we propose a weakly-supervised method that combines the strengths of both approaches by exploiting both geographic and semantic features. We tested our method against a historical toponym resolution benchmark and improved the state of the art. We also created five datasets and tested the performance of two state-of-the-art out-of-the-box entity linking methods and also improved on their performance when only locations are considered. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1145/3078081.3078099 | DATeCH |
DocType | ISBN | Citations |
Conference | 978-1-4503-5265-9 | 0 |
PageRank | References | Authors |
0.34 | 10 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Mariona Coll Ardanuy | 1 | 0 | 0.34 |
Caroline Sporleder | 2 | 453 | 31.84 |