Title
Bisection and twisted SVD on GPU
Abstract
Singular value decomposition (SVD) is one of the most important factorizations in matrix computation. However, computing SVD is still time-consuming, especially when the dimension of matrices exceeds tens of thousands. In this paper, we present a high performance approach called “Bisection and Twisted” (BT) for solving bidiagonal SVD. As modern general purpose GPUs have shown their extreme computational advantages in parallel computing, we implement the BT algorithm on single and multiple GPUs. With our carefully designed GPU kernels, the BT algorithm is about 10 times faster than MKL divide-and-conquer routine DBDSDC on an 8-core 2.53GHz CPU, and 36 times faster than CULA QR routine DBDSQR on the same GPUs. Additionally, the BT algorithm is able to compute SVD for matrices of size 1 million by 1 million with only two GPUs. To the best of our knowledge, no implementation has achieved such a scale.
Year
DOI
Venue
2015
10.1109/HPEC.2015.7322479
2015 IEEE High Performance Extreme Computing Conference (HPEC)
Keywords
Field
DocType
bisection and twisted SVD,singular value decomposition,matrix factorizations,high performance approach,bidiagonal SVD,general purpose GPU,parallel computing,BT algorithm,GPU kernels
Kernel (linear algebra),Singular value decomposition,Algorithm design,Bisection,Instruction set,Computer science,Matrix (mathematics),Matrix decomposition,Parallel computing,Numerical linear algebra
Conference
ISSN
Citations 
PageRank 
2377-6943
0
0.34
References 
Authors
5
7
Name
Order
Citations
PageRank
Lu He161.15
Yan Luo214819.24
Rui Liu341.46
Hengyong Yu429335.54
Yu Cao510014.01
Xuzhou Chen6183.55
Seung Woo Son729631.43