Title
Online Distributed Estimation of Principal Eigenspaces
Abstract
Principal components analysis (PCA) is a widely used dimension reduction technique with an extensive range of applications. In this paper, an online distributed algorithm is proposed for recovering the principal eigenspaces. We further establish its rate of convergence and show how it relates to the number of nodes employed in the distributed computation, the effective rank of the data matrix under consideration, and the gap in the spectrum of the underlying population covariance matrix. The proposed algorithm is illustrated on low-rank approximation and k-means clustering tasks. The numerical results show a substantial computational speed-up vis-a-vis standard distributed PCA algorithms, without compromising learning accuracy.
Year
DOI
Venue
2019
10.1109/DSW.2019.8755554
2019 IEEE Data Science Workshop (DSW)
Keywords
Field
DocType
distributed estimation,principal components analysis,local and global aggregations,streaming data
Population,Mathematical optimization,Dimensionality reduction,Algorithm,Distributed algorithm,Rate of convergence,Covariance matrix,Cluster analysis,Mathematics,Principal component analysis,Computation
Journal
Volume
ISBN
Citations 
abs/1905.07389
978-1-7281-0709-7
0
PageRank 
References 
Authors
0.34
3
3