Title
A multithreaded multicore system for embedded media processing
Abstract
We describe a multicore system targeting media processing applications where the cores are multithreaded. The multithreaded cores use a new type of multithreading that we call Subset Static Interleaved (SSI) multithreading. SSI multithreading combines the advantages of blocked multithreading and a simple form of interleaved multithreading called static interleaved multithreading. SSI multithreading divides threads into foreground and background threads and performs static interleaving among the foreground threads. A foreground thread is swapped with a runnable background thread whenever the foreground thread is stalled. SSI multithreading achieves reduced operation latencies, memory latency tolerance, fast context switching, and compared to traditional dynamic interleaving, a relatively low design complexity of the register file. We use a task scheduling unit (TSU) to dispatch tasks to the cores. The TSU is aware of the fact that the cores are multithreaded. This makes a more efficient mapping of tasks to cores possible by scheduling tasks on the least loaded cores. We evaluate the system on an optimized Super HD H.264 decoder where the macroblock decoding and deblocking has been parallelized. The complexity of the H.264 standard and the high resolution makes this a challenging and performance demanding application. We achieve speedups of up to 17.7 times for 16 cores with four threads per core relative to a single-threaded single core. Furthermore, the proposed SSI multithreading achieves a speedup of 1.52 times relative to no multithreading, while blocked multithreading achieves only 1.38 times and a restricted form of interleaved multithreading achieves only 1.37 times speedup.
Year
DOI
Venue
2011
10.1007/978-3-642-19448-1_9
T. HiPEAC
Keywords
DocType
Volume
background thread,times speedup,static interleaved multithreading,multithreaded multicore system,interleaved multithreading,h.264 decoder,ssi multithreading,embedded media processing,multithreaded core,runnable background thread,foreground thread,proposed ssi multithreading,register file,high resolution,memory latency
Journal
3
Citations 
PageRank 
References 
16
0.87
18
Authors
2
Name
Order
Citations
PageRank
Jan Hoogerbrugge126122.61
Andrei Terechko21338.64