Title
Pairwise Constraints-Guided Non-negative Matrix Factorization for Document Clustering
Abstract
Nonnegative Matrix Factorization (NMF) has been proven to be effective in text mining. However, since NMF is a well-known unsupervised components analysis technique, the existing NMF method can not deal with prior constraints, which are beneficial to clustering or classification tasks. In this paper, we address the text clustering problem via a novel strategy, called Pairwise Constraintsguided Non-negative Matrix Factorization (PCNMF for short). Differing from the traditional NMF method, the proposed method can capture the available abundance prior constraints in original space, which result in more effective for clustering or information retrieval. Therefore, PCNMF enforces the discriminative capability in the reduced space. Utilizing the appropriate transformation, PCNMF represents as a new optimization problem, which can be efficiently solved by an iterative approach. The cluster membership of each document can be easily determined as the standard NMF. Empirical studies based on Benchmark document corpus demonstrate appealing results.
Year
DOI
Venue
2007
10.1109/WI.2007.66
Web Intelligence
Keywords
Field
DocType
traditional nmf method,pairwise constraints-guided non-negative matrix,pairwise constraintsguided non-negative matrix,available abundance prior constraint,benchmark document corpus,original space,nonnegative matrix factorization,standard nmf,new optimization problem,document clustering,existing nmf method,optimization problem,clustering,data mining,empirical study,internet,text clustering,non negative matrix factorization,text mining,pattern recognition,iterative methods,information retrieval,automation,feature extraction
Data mining,Computer science,Document clustering,Artificial intelligence,Cluster analysis,Discriminative model,Optimization problem,Pairwise comparison,Information retrieval,Pattern recognition,Iterative method,Matrix decomposition,Non-negative matrix factorization
Conference
ISBN
Citations 
PageRank 
978-0-7695-3026-0
2
0.38
References 
Authors
15
2
Name
Order
Citations
PageRank
Yang Yu-Jiu18919.30
Hu Bao-Gang2138683.23