Abstract | ||
---|---|---|
We develop a fused matrix multiplication kernel that unifies sampled dense-dense matrix multiplication and sparsedense matrix multiplication under a single operation called FusedMM. By using user-defined functions, FusedMM can capture almost all computational patterns needed by popular graph embedding and GNN approaches.FusedMM is an order of magnitude faster than its equivalent kernels in Deep Gr... |
Year | DOI | Venue |
---|---|---|
2021 | 10.1109/IPDPS49936.2021.00034 | 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS) |
Keywords | DocType | ISSN |
Distributed processing,Program processors,Memory management,Bandwidth,Load management,Libraries,Graph neural networks | Conference | 1530-2075 |
ISBN | Citations | PageRank |
978-1-6654-4066-0 | 1 | 0.36 |
References | Authors | |
0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Rahman, M.K. | 1 | 5 | 5.85 |
Majedul Haque Sujon | 2 | 1 | 0.36 |
Ariful Azad | 3 | 138 | 15.71 |