Title
Classifier Grouping to Enhance Data Locality for a Multi-threaded Object Detection Algorithm
Abstract
Object detection has become an enabling function for modern smart embedded devices to perform intelligent applications and interact with the environment appropriately and promptly. However, the limited computation resource of embedded devices has become a barrier to execute the computation intensive object detection algorithm. Leveraging the multi-threading scheme on embedded multi-core systems provides an opportunity to boost the performance. However, the memory bottleneck limits the performance scalability. Improving data locality of applications and maximizing the data reuse for on-chip caches have therefore become critical design concerns. This paper comprehensively analyzes the memory behavior and data locality of a multi-threaded object detection algorithm. A novel Classifier-Grouping scheme is proposed to significantly enhance the data reuse for on-chip caches of embedded multicore systems. By executing a multi-threaded object detection algorithm on a cycle-accurate multi-core simulator, the proposed approach can achieve up to 62% better performance when compared with the original parallel program.
Year
DOI
Venue
2011
10.1109/ICPADS.2011.43
Parallel and Distributed Systems
Keywords
Field
DocType
cache storage,embedded systems,knowledge based systems,multi-threading,multiprocessing systems,object detection,classifier grouping,data locality,embedded multicore systems,memory behavior,multithreaded object detection algorithm,on-chip caches,data locality,embedded device,multi-core,object detection,parallel processing
Multithreading,Bottleneck,Object detection,Locality,Computer science,Knowledge-based systems,Algorithm,Real-time computing,Multi-core processor,Computation,Distributed computing,Scalability
Conference
ISSN
ISBN
Citations 
1521-9097
978-1-4577-1875-5
4
PageRank 
References 
Authors
0.44
9
3
Name
Order
Citations
PageRank
Bo-Cheng Charles Lai172.24
Chih-Hsuan Chiang240.44
Guan-Ru Li370.83