Abstract | ||
---|---|---|
The realistic simulation of ultrasound wave propagation is computationally intensive. The large size of the grid and low degree of reuse of data means that it places a great demand on memory bandwidth. Graphics Processing Units (GPUs) have attracted attention for performing scientific calculations due to their potential for efficiently performing large numbers of floating point computations. However, many applications may be limited by memory bandwidth, especially for data sets whose size is larger than that of the GPU platform. This problem is only partially mitigated by applying the standard technique of breaking the grid into regions and overlapping the computation of one region with the host-device memory transfer of another. In this paper, we implement a memory-bound GPU-based ultrasound simulation and evaluate the use of a technique for improving performance by compressing the data into a fixed-point representation that reduces the time required for inter-host-device transfers. We demonstrate a speedup of 1.5 times on a simulation where the data is broken into regions that must be copied back and forth between the CPU and GPU. We develop a model that can be used to determine the amount of temporal blocking required to achieve near optimal performance, without extensive experimentation. This technique may also be applied to GPU-based scientific simulations in other domains such as computational fluid dynamics and electromagnetic wave simulation. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1109/IPDPSW.2014.140 | Parallel & Distributed Processing Symposium Workshops |
Keywords | Field | DocType |
data compression,graphics processing units,ultrasonic propagation,GPU based ultrasound simulation acceleration,GPU platform,data compression,fixed point representation,floating point computations,graphics processing units,host device memory transfer,interhost device transfers,memory bandwidth,scientific calculations,ultrasound wave propagation,Data compression,GPGPU,Memory architecture,Nonlinear acoustics,Parallel architectures | Central processing unit,Memory bandwidth,CUDA,Floating point,Computer science,Parallel computing,General-purpose computing on graphics processing units,Grid,Memory architecture,Speedup | Conference |
Citations | PageRank | References |
0 | 0.34 | 14 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Andrew A. Haigh | 1 | 0 | 0.34 |
Eric McCreath | 2 | 132 | 14.64 |