Title
Grouped Text Clustering Using Non-Parametric Gaussian Mixture Experts.
Abstract
Text clustering has many applications in various areas. Before being clustered, texts often have already been grouped or partially grouped in practise. Texts from the same group are related to each other and concentrate on a few topics. The group information turns out to be valuable for text clustering. In this paper, we propose a model called Non-parametric Gaussian Mixture Experts to get better clustering result through utilizing group information. After converting texts to vectors by semantic embedding, our model can automatically infer proper cluster number for every group and the whole corpus. We develop an online variational inference algorithm which is scalable and can handle incremental datasets. Our algorithm is tested on various text datasets. The results demonstrate our model has significantly better performance in cluster quality than some other classical and recent text clustering methods.
Year
DOI
Venue
2016
10.1007/978-3-319-42911-3_42
PRICAI
Field
DocType
Volume
Data mining,Fuzzy clustering,Document clustering,Computer science,Artificial intelligence,Cluster analysis,Single-linkage clustering,Canopy clustering algorithm,Clustering high-dimensional data,Pattern recognition,Correlation clustering,Determining the number of clusters in a data set,Machine learning
Conference
9810
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
7
5
Name
Order
Citations
PageRank
Yong Tian110.70
Yu Rong201.01
Yuan Yao382.51
Weidong Liu49317.66
Jiaxing Song5509.62