Title
AIFSP: An Adaptive Instruction Flow Stream Processor
Abstract
Stream processor is efficient for media applications as it exploits the features of media processing, such as data parallelism, producer-consumer locality and so on. However, the loosely coupled structure between host and stream processor makes the communication between scalar and SIMD part costly and scheduling across kernels less flexible. Besides, the kernel loading time adds additional cost. When the stream length becomes shorter the performance degradation caused by these factors is unacceptable. In addition to the loosely coupled structure, lack of efficient support for chained scalar and SIMD kernels makes the case worse. To overcome these shortcomings of stream processor, we propose a target architecture named AIFSP, which merges the host and stream processor together into a tightly coupled structure with both scalar and SIMD part. The whole processor can run in a single or dual instruction flow mode, adaptive to the characteristic of applications. When running in a single instruction flow mode, costless communication between scalar and SIMD part, flexible scheduling across kernels and zero kernel loading time can be achieved, the speedup for short streams can reach 2.6x, while in dual instruction flow mode, the scalar and SIMD kernels can run concurrently on scalar and SIMD part of AIFSP, thus kernel overlapping is realized and about 20% performance improving can be attained when SIMD width is set to 8, with the increase of SIMD width, the performance gain will be larger.
Year
DOI
Venue
2011
10.1109/ISVLSI.2011.62
ISVLSI
Keywords
Field
DocType
zero kernel loading time,processor scheduling,parallel processing,simd width,enhanced scalar processor,stream length,simd part,adaptive instruction flow stream processor,simd kernels,kernel loading time,data parallelism,aifsp,stream length effect,kernel overlapping,dual instruction flow mode,media applications,whole processor,producer-consumer locality,stream processor,simd kernel,short stream,chained scalar,adaptive instruction flow stream,single instruction flow mode,microcontrollers,process control,media,kernel
Kernel (linear algebra),Computer science,Scheduling (computing),Parallel computing,Scalar (physics),SIMD,Data parallelism,Microcontroller,Stream processing,Speedup
Conference
ISSN
ISBN
Citations 
2159-3469 E-ISBN : 978-0-7695-4447-2
978-0-7695-4447-2
0
PageRank 
References 
Authors
0.34
9
5
Name
Order
Citations
PageRank
Yaohua Wang14414.23
Shuming Chen213838.21
Jiang-Hua Wan3155.86
Kai Zhang400.34
Shenggang Chen5215.98