Combining classification with clustering for web person disambiguation - Citegraph

Paper Info

Title
Combining classification with clustering for web person disambiguation

Abstract
Web Person Disambiguation is often conducted through clustering web documents to identify different namesakes for a given name. This paper presents a new key-phrased clustering method combined with a second step re-classification to identify outliers to improve cluster performance. For document clustering, the hierarchical agglomerative approach is conducted based on the vector space model which uses key phrases as the main feature. Outliers of cluster results are then identified through a centroids-based method. The outliers are then reclassified by the SVM classifier into the more appropriate clusters using a key phrase-based string kernel model as its feature space. The re-classification uses the clustering result in the first step as its training data so as to avoid the use of separate training data required by most classification algorithms. Experiments conducted on the WePS-2 dataset show that the algorithm based on key phrases is effective in improving the WPD performance.

Year	DOI	Venue
2012	10.1145/2187980.2188165	WWW (Companion Volume)
Keywords	Field	DocType
document clustering,centroids-based method,wpd performance,cluster performance,combining classification,clustering web document,web person disambiguation,clustering result,key phrase,key phrase-based string kernel,new key-phrased clustering method,appropriate cluster,vector space model,feature space,web personalization,svm,string kernel	Hierarchical clustering,Canopy clustering algorithm,Data mining,Fuzzy clustering,CURE data clustering algorithm,Correlation clustering,Pattern recognition,Computer science,Artificial intelligence,FLAME clustering,Cluster analysis,Brown clustering	Conference
Citations	PageRank	References
6	0.46	6
Authors
3

Authors (3 rows)

Cited by (6 rows)

References (6 rows)

Name	Order	Citations	PageRank
Jian Xu	1	26	5.23
Qin Lu	2	689	66.45
Zhengzhong Liu	3	37	7.69

1