Title
Concept-Enhanced Multi-view Co-clustering of Document Data.
Abstract
The maturity of structured knowledge bases and semantic resources has contributed to the enhancement of document clustering algorithms, that may take advantage of conceptual representations as an alternative for classic bag-of-words models. However, operating in the semantic space is not always the best choice in those domain where the choice of terms also matters. Moreover, users are usually required to provide a valid number of clusters as input, but this parameter is often hard to guess, due to the exploratory nature of the clustering process. To address these limitations, we propose a multi-view co-clustering approach that processes simultaneously the classic document-term matrix and an enhanced document-concept representation of the same collection of documents. Our algorithm has multiple key-features: it finds an arbitrary number of clusters and provides clusters of terms and concepts as easy-to-interpret summaries. We show the effectiveness of our approach in an extensive experimental study involving several corpora with different levels of complexity.
Year
DOI
Venue
2017
10.1007/978-3-319-60438-1_45
Lecture Notes in Artificial Intelligence
Keywords
Field
DocType
Co-clustering,Semantic enrichment,Multi-view clustering
Data mining,Cluster (physics),Information retrieval,Computer science,Document clustering,Matrix (mathematics),Consensus clustering,Conceptual clustering,Biclustering,Cluster analysis,Brown clustering
Conference
Volume
ISSN
Citations 
10352
0302-9743
0
PageRank 
References 
Authors
0.34
11
2
Name
Order
Citations
PageRank
Valentina Rho1354.42
Ruggero G. Pensa235431.20