Abstract | ||
---|---|---|
Constrained clustering is attracting attention as a useful way for grouping a data set into intended clusters using users' feedback. We develop an interactive document clustering method by employing constrained clustering in order to group patent documents into some technological categories based on their contents. This method aims to progressively improve the accuracy of clustering to repeat both the assigning of appropriate cluster to documents and applying constrained clustering. We evaluate how many documents our method needs in order to reach an adequate accuracy and which document should be given to accomplish the desired result in fewer assignments. Moreover, by repeating both the assigning and clustering, it comes to the point at which the clustering accuracy is improved by just only the number of documents given true. We propose an approach to predict such a point based on the amount of cluster label changes in the K-Means loop. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1145/1651343.1651347 | Proceedings of the 2nd international workshop on Patent information retrieval |
Keywords | DocType | Citations |
fewer assignment,patent document set,cluster label change,clustering accuracy,appropriate cluster,intended cluster,stability of clustering,interactive document,k-means loop,adequate accuracy,group patent document,constrained clustering,patent map,method need,k means,document clustering | Conference | 2 |
PageRank | References | Authors |
0.37 | 2 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yusuke Sato | 1 | 2 | 0.71 |
Makoto Iwayama | 2 | 436 | 87.03 |