Title
A Parallel Data Mining Algorithm for PageRank Computation
Abstract
We study the utility of graphics processing units (GPUs) for an acceleration of the data mining PageRank algorithm and a reduction of the memory size of the web graph. We first present a new web graph representation using a compressed format in order to reduce the memory allocation of the web graph. Then, this web graph is simply partitioned into small chunks to be processed on the GPUs' device. The basic steps of the algorithm are then split up into parallel operations allowing to exploit the computing power of GPUs in the CUDA language as best as possible. In the experiments, we have tested the algorithm using GPUs with a set of real web data, and compared the computation with a CPU-based one. The obtained results show that the proposed PageRank computation on GPUs outperforms the CPU version by a factor of 100, reducing at the same time the web graph memory storage by 93, 928%.
Year
DOI
Venue
2016
10.1145/3010089.3010118
Proceedings of the International Conference on Big Data and Advanced Wireless Technologies
DocType
ISBN
Citations 
Conference
978-1-4503-4779-2
0
PageRank 
References 
Authors
0.34
2
5
Name
Order
Citations
PageRank
Massinissa Saoudi181.87
Massinissa Lounis2222.88
Ahcène Bounceur330635.05
Reinhardt Euler483.39
Tahar M. Kechadi5273.24