Title
Things and Strings: Improving Place Name Disambiguation from Short Texts by Combining Entity Co-Occurrence with Topic Modeling.
Abstract
Place name disambiguation is the task of correctly identifying a place from a set of places sharing a common name. It contributes to tasks such as knowledge extraction, query answering, geographic information retrieval, and automatic tagging. Disambiguation quality relies on the ability to correctly identify and interpret contextual clues, complicating the task for short texts. Here we propose a novel approach to the disambiguation of place names from short texts that integrates two models: entity co-occurrence and topic modeling. The first model uses Linked Data to identify related entities to improve disambiguation quality. The second model uses topic modeling to differentiate places based on the terms used to describe them. We evaluate our approach using a corpus of short texts, determine the suitable weight between models, and demonstrate that a combined model outperforms benchmark systems such as DBpedia Spotlight and Open Calais in terms of F1-score and Mean Reciprocal Rank.
Year
DOI
Venue
2016
10.1007/978-3-319-49004-5_23
EKAW
Keywords
Field
DocType
Place name disambiguation,Natural language processing,LDA,Wikipedia,DBpedia,Linked Data
Data mining,Common name,Computer science,Linked data,Artificial intelligence,Natural language processing,Name disambiguation,Information retrieval,Geographic information retrieval,Co-occurrence,Mean reciprocal rank,Knowledge extraction,Topic model
Conference
Volume
ISSN
Citations 
10024
0302-9743
3
PageRank 
References 
Authors
0.37
15
6
Name
Order
Citations
PageRank
Yiting Ju1292.33
Benjamin Adams2737.78
Krzysztof Janowicz31660105.59
Yingjie Hu441739.76
Bo Yan5517.96
Grant McKenzie612313.85