Title
Parallel scalability in speech recognition
Abstract
We propose four application-level implementation alternatives called algorithm styles and construct highly optimized implementations on two parallel platforms: an Intel Core i7 multicore processor and a NVIDIA GTX280 manycore processor. The highest performing algorithm style varies with the implementation platform. On a 44-min speech data set, we demonstrate substantial speedups of 3.4 X on Core i7 and 10.5 X on GTX280 compared to a highly optimized sequential implementation on Core i7 without sacrificing accuracy. The parallel implementations contain less than 2.5% sequential overhead, promising scalability and significant potential for further speedup on future platforms.
Year
DOI
Venue
2009
10.1109/MSP.2009.934124
Signal Processing Magazine, IEEE
Keywords
DocType
Volume
parallel processing,speech processing,application-level implementation,speech recognition,algorithm styles,intel core i7 multicore processor,nvidia gtx280 manycore processor,parallel scalability,application software,multicore processors,scalability,multicore processing,feature extraction,algorithm design and analysis,space exploration,synchronization,engines
Journal
26
Issue
ISSN
Citations 
6
1053-5888
22
PageRank 
References 
Authors
1.16
11
8
Name
Order
Citations
PageRank
Kisun You1989.64
Jike Chong213611.62
Youngmin Yi328125.93
Ekaterina Gonina4736.50
Christopher Hughes5292.02
Yen-Kuang Chen688895.79
Wonyong Sung71445166.19
Kurt Keutzer85040801.67