Title
High Performance Coordinate Descent Matrix Factorization for Recommender Systems.
Abstract
Coordinate descent (CD) has been proved to be an effective technique for matrix factorization (MF) in recommender systems. To speed up factorizing performance, various methods of implementing parallel CDMF have been proposed to leverage modern multi-core CPUs and many-core GPUs. Existing implementations are limited in either speed or portability (constrained to certain platforms). In this paper, we present an efficient and portable CDMF solver for recommender systems. On the one hand, we diagnose the baseline implementation and observe that it lacks the awareness of the hierarchical thread organization on modern hardware and the data variance of the rating matrix. Thus, we apply the thread batching technique and the load balancing technique to achieve high performance. On the other hand, we implement the CDMF solver in OpenCL so that it can run on various platforms. Based on the architectural specifics, we customize code variants to efficiently map them to the underlying hardware. The experimental results show that our implementation performs 2x faster on dual-socket Intel Xeon CPUs and 22x faster on an NVIDIA K20c GPU than the baseline implementations. When taking the CDMF solver as a benchmark, we observe that it runs 2.4x faster on the GPU than on the CPUs, whereas it achieves competitive performance on Intel MIC against the CPUs.
Year
DOI
Venue
2017
10.1145/3075564.3077625
Conf. Computing Frontiers
Keywords
Field
DocType
Matrix factorization, Coordinate descent, Performance, OpenCL
Recommender system,Computer science,Xeon Phi,Parallel computing,Matrix decomposition,Real-time computing,Software portability,Solver,Coordinate descent,Xeon,Speedup
Conference
Citations 
PageRank 
References 
2
0.36
31
Authors
6
Name
Order
Citations
PageRank
Xi Yang13717.39
Jianbin Fang226525.31
Jing Chen328560.83
Chengkun Wu4217.79
Tao Tang5427.44
Kai Lu646557.59