Title
Multi-view document clustering based on geometrical similarity measurement
Abstract
Numerous works implemented multi-view clustering algorithms in document clustering. A challenging problem in document clustering is the similarity metric. Existing multi-view document clustering methods broadly utilized two measurements: the Cosine similarity (CS) and the Euclidean distance (ED). The first did not consider the magnitude difference (MD) between the two vectors. The second can't register the divergence of two vectors that offer a similar ED. In this paper, we originally created five models of similarity metric. This methodology foils the downside of the CS and ED similarity metrics by figuring the divergence between documents with the same ED while thinking about their sizes. Furthermore, we proposed our multi-view document clustering plan which dependent on the proposed similarity metric. Firstly, CS, ED, triangle's area similarity and sector's area similarity metric, and our five similarity metrics have been applied to every view of a dataset to generate a corresponding similarity matrix. Afterward, we ran clustering algorithms on these similarity matrices to evaluate the performance of single view. Later, we aggregated these similarity matrices to obtain a unified similarity matrix and apply spectral clustering algorithm on it to generate the final clusters. The experimental results show that the proposed similarity functions can gauge the similitude between documents more accurately than the existing metrics, and the proposed clustering scheme surpasses considerably up-to-date algorithms.
Year
DOI
Venue
2022
10.1007/s13042-021-01295-8
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS
Keywords
DocType
Volume
Multi-view clustering, Ensemble clustering, Similarity measurement, Document clustering
Journal
13
Issue
ISSN
Citations 
3
1868-8071
0
PageRank 
References 
Authors
0.34
0
5
Name
Order
Citations
PageRank
Bassoma Diallo122.42
Jie Hu293.89
Tianrui Li33176191.76
Ghufran Ahmad Khan410.69
Ahmed Saad Hussein500.34