Abstract | ||
---|---|---|
Widespread use of wavelet transforms as in JPEG2000 demands efficient implementations on general purpose computers as well as dedicated hardware. The increasing availability of SIMD technologies is a great challenge since efficient SIMD parallelizations are not trivial. This work presents a parallelized 2-D wavelet transform following a single-loop approach, i.e. a loop fusion of the lifting steps of horizontal filtering, and interleaving horizontal and vertical filtering for optimal temporal locality. In this way, each input value is read only once and each output value is written once without subsequent updates. Such an approach turns out to be a necessary basis for an efficient SIMD parallelization. Results are obtained on a general purpose processor with a 4-fold single-precision SIMD extension. Speedups of about 3.7 due to the use of SIMD, 2.55 due to the single-loop approach and up to 6 due to cache effects for pathologic data sizes are obtained, giving total speedups of up to 56. |
Year | DOI | Venue |
---|---|---|
2006 | 10.1109/PDP.2006.14 | parallel, distributed and network-based processing |
Keywords | Field | DocType |
general purpose processor,general purpose computer,2-D wavelet,Single-Loop Approach,SIMD technology,4-fold single-precision SIMD extension,efficient SIMD parallelizations,2-D Wavelet Lifting,single-loop approach,efficient implementation,SIMD Parallelization,efficient SIMD parallelization,Widespread use | Loop fusion,Locality of reference,Lifting scheme,Computer science,Parallel computing,SIMD,Second-generation wavelet transform,Discrete wavelet transform,Stationary wavelet transform,Wavelet packet decomposition | Conference |
ISSN | ISBN | Citations |
1066-6192 | 0-7695-2513-X | 18 |
PageRank | References | Authors |
0.99 | 9 | 1 |
Name | Order | Citations | PageRank |
---|---|---|---|
Rade Kutil | 1 | 61 | 8.80 |