Title
A runtime resolution scheme for priority boost conflict in implicit coscheduling
Abstract
High-performance parallel and scientific applications are composed of multiple processes running on distinct CPUs that communicate frequently. Due to the synchronization needs of such applications, performance is greatly hampered if their processes are not scheduled simultaneously on the CPUs. Implicit coscheduling (ICS) is a well-known technique to address this problem in multi-programmed clusters, however, traditional ICS schemes do not incorporate steps to adequately deal with priority boost conflicts, leading to significantly degraded performance. In this paper, we propose the use of runtime difference in contention across nodes to provide more sophisticated coscheduling decisions in response to the conflicts. We also present a novel coscheduling scheme termed PROC (Process ReOrdering-based Coscheduling) that adaptively regulates the scheduling sequence of conflicting processes based on the rescheduling latency of their correspondents in remote nodes. We perform extensive simulation-based experiments using both synthetic and realistic workloads to analyze the performance of PROC compared to alternatives such as local scheduling, a widely used batch scheduling, gang scheduling, and existing ICS schemes. The results show that all ICS schemes commonly experience priority boost conflicts, and that the proposed PROC significantly outperforms other ICS alternatives (or batch scheduling) by up to 50.4% (or 72.5%) in the average job response time. This improvement is achieved by reducing wasted idle time and spinning time without sacrificing fairness.
Year
DOI
Venue
2007
10.1007/s11227-006-0006-3
The Journal of Supercomputing
Keywords
Field
DocType
Clusters,Coscheduling,Priority boost conflict,Runtime contention,Rescheduling latency,Process reordering,Performance comparison
Synchronization,Supercomputer,Computer science,Parallel algorithm,Scheduling (computing),Coscheduling,Parallel computing,Gang scheduling,Job scheduler,Computer cluster,Distributed computing
Journal
Volume
Issue
ISSN
40
1
0920-8542
Citations 
PageRank 
References 
2
0.39
28
Authors
3
Name
Order
Citations
PageRank
Junglok Yu1416.09
Jin-Soo Kim21806122.94
Seung Ryoul Maeng38519.68