Title
Entity resolution using search engine results
Abstract
Given a set of automatically extracted entities E of size n, we would like to cluster all the various names referring to the same canonical entity together. The variations of each entity include acronyms, full name, and informal naming conventions. We propose using search engine results to cluster variations of each entity based on the URLs appearing in those results. We create a cluster C for each top search result returned by querying for the entity e ∈ E assigning e to the cluster C. Our experiments on a manually created dataset shows that our approach achieves higher precision and recall than string matching algorithm and hierarchical clustering based disambiguation methods.
Year
DOI
Venue
2012
10.1145/2396761.2398641
CIKM
Keywords
Field
DocType
dataset shows,entities e,entity resolution,e assigning e,top search result,search engine result,cluster variation,cluster c.,disambiguation method,canonical entity,cluster c,search engines
String searching algorithm,Hierarchical clustering,Data mining,Name resolution,Search engine,Information retrieval,Computer science,Precision and recall
Conference
Citations 
PageRank 
References 
0
0.34
8
Authors
3
Name
Order
Citations
PageRank
Madian Khabsa123718.81
Pucktada Treeratpituk217711.12
C. Lee Giles3111541549.48