Abstract | ||
---|---|---|
Computer generated hologram (CGH) is a promising technology for realizing 3D displays. Large-scale CGH has an advantage that it resolves problems of existing 3D displays. However, the large-scale CGH generation requires a lot of memory space and computation time in proportion to pixel number. Further, in order to use CGH as a display, it needs to be generated in real time, and this is the reason why CGH does not suit to practical use. Computation of CGH is comprised of data-independent operations and current GPU has thousands of processing cores. Thus, acceleration of CGH generation can be expected by using GPU. To accelerate CGH generation processing, we adapt several parallelization and optimization techniques to the CGH program both for single node and multiple ones. The single node optimization techniques include the way of object decomposition, the reduction of data transfer amount between CPU and GPU, the kernel integration, stream processing, and the utilization of multi-GPU parallelism. The multi-node optimization includes inter-node data distribution method. The results show that we have achieved 134.7 times speed-up compared to sequential program execution by CPU. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1109/CANDAR.2017.53 | 2017 FIFTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR) |
Keywords | Field | DocType |
CGH, multi-GPU, cluster, object decomposition method, optimization | Iterative reconstruction,Kernel (linear algebra),GPU cluster,Data transmission,Computer science,Computational science,Acceleration,Pixel,Stream processing,Computation | Conference |
ISSN | Citations | PageRank |
2379-1888 | 1 | 0.48 |
References | Authors | |
0 | 8 |
Name | Order | Citations | PageRank |
---|---|---|---|
Shinpei Watanabe | 1 | 1 | 1.50 |
Boaz Jessie Jackin | 2 | 1 | 1.83 |
Takeshi Ohkawa | 3 | 21 | 16.24 |
Kanemitsu Ootsu | 4 | 44 | 23.90 |
Takashi Yokota | 5 | 41 | 21.70 |
Yoshio Hayasaki | 6 | 21 | 4.63 |
Toyohiko Yatagai | 7 | 2 | 1.52 |
Takanobu Baba | 8 | 71 | 27.53 |