Abstract | ||
---|---|---|
We propose four application-level implementation alternatives called algorithm styles and construct highly optimized implementations on two parallel platforms: an Intel Core i7 multicore processor and a NVIDIA GTX280 manycore processor. The highest performing algorithm style varies with the implementation platform. On a 44-min speech data set, we demonstrate substantial speedups of 3.4 X on Core i7 and 10.5 X on GTX280 compared to a highly optimized sequential implementation on Core i7 without sacrificing accuracy. The parallel implementations contain less than 2.5% sequential overhead, promising scalability and significant potential for further speedup on future platforms. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1109/MSP.2009.934124 | Signal Processing Magazine, IEEE |
Keywords | DocType | Volume |
parallel processing,speech processing,application-level implementation,speech recognition,algorithm styles,intel core i7 multicore processor,nvidia gtx280 manycore processor,parallel scalability,application software,multicore processors,scalability,multicore processing,feature extraction,algorithm design and analysis,space exploration,synchronization,engines | Journal | 26 |
Issue | ISSN | Citations |
6 | 1053-5888 | 22 |
PageRank | References | Authors |
1.16 | 11 | 8 |
Name | Order | Citations | PageRank |
---|---|---|---|
Kisun You | 1 | 98 | 9.64 |
Jike Chong | 2 | 136 | 11.62 |
Youngmin Yi | 3 | 281 | 25.93 |
Ekaterina Gonina | 4 | 73 | 6.50 |
Christopher Hughes | 5 | 29 | 2.02 |
Yen-Kuang Chen | 6 | 888 | 95.79 |
Wonyong Sung | 7 | 1445 | 166.19 |
Kurt Keutzer | 8 | 5040 | 801.67 |