Title
Exposing tunable parameters in multi-threaded numerical code
Abstract
Achieving high performance on today's architectures requires careful orchestration of many optimization parameters. In particular, the presence of shared-caches on multicore architecturesmakes it necessary to consider, in concert, issues related to both parallelism and data locality. This paper presents a systematic and extensive exploration of thecombined search space of transformation parameters that affect both parallelism and data locality inmulti-threaded numerical applications.We characterize the nature of the complex interaction between blocking, problem decomposition and selection of loops for parallelism. We identify key parameters for tuning and provide an automatic mechanism for exposing these parameters to a search tool. A series of experiments on two scientific benchmarks illustrates the non-orthogonality of the transformation search space and reiterates the need for integrated transformation heuristics for achieving high-performance on current multicore architectures.
Year
DOI
Venue
2010
10.1007/978-3-642-15672-4_6
NPC
Keywords
Field
DocType
data locality,search tool,multi-threaded numerical code,multicore architecturesmakes,automatic mechanism,current multicore architecture,transformation search space,careful orchestration,tunable parameter,transformation parameter,thecombined search space,integrated transformation heuristics,search space,integral transforms
Instruction-level parallelism,Locality,Memory hierarchy,Task parallelism,Computer science,Parallel computing,Data parallelism,Heuristics,Orchestration (computing),Multi-core processor,Distributed computing
Conference
Volume
ISSN
ISBN
6289
0302-9743
3-642-15671-1
Citations 
PageRank 
References 
2
0.38
17
Authors
4
Name
Order
Citations
PageRank
Apan Qasem112716.34
Jichi Guo2443.31
Faizur Rahman320.38
Qing Yi419011.89