Title
Reducing microarray data via nonnegative matrix factorization for visualization and clustering analysis
Abstract
In microarray data analysis, each gene expression sample has thousands of genes and reducing such high dimensionality is useful for both visualization and further clustering of samples. Traditional principal component analysis (PCA) is a commonly used method which has problems. Nonnegative Matrix Factorization (NMF) is a new dimension reduction method. In this paper we compare NMF and PCA for dimension reduction. The reduced data is used for visualization, and clustering analysis via k-means on 11 real gene expression datasets. Before the clustering analysis, we apply NMF and PCA for reduction in visualization. The results on one leukemia dataset show that NMF can discover natural clusters and clearly detect one mislabeled sample while PCA cannot. For clustering analysis via k-means, NMF most typically outperforms PCA. Our results demonstrate the superiority of NMF over PCA in reducing microarray data. (C) 2007 Elsevier Inc. All rights reserved.
Year
DOI
Venue
2008
10.1016/j.jbi.2007.12.003
JOURNAL OF BIOMEDICAL INFORMATICS
Keywords
Field
DocType
microarray data,Nonnegative Matrix Factorization,principal component analysis,visualization,clustering analysis
Data mining,Clustering high-dimensional data,Dimensionality reduction,Pattern recognition,Visualization,Computer science,Curse of dimensionality,Microarray analysis techniques,Artificial intelligence,Non-negative matrix factorization,Cluster analysis,Principal component analysis
Journal
Volume
Issue
ISSN
41
4
1532-0464
Citations 
PageRank 
References 
16
0.74
8
Authors
3
Name
Order
Citations
PageRank
Weixiang Liu110512.26
Yuan Kehong2528.00
Ye Datian34110.06