Title
The Resource Usage Aware Backfilling
Abstract
Job scheduling policies for HPC centers have been extensively studied in the last few years, especially backfilling based policies. Almost all of these studies have been done using simulation tools. All the existent simulators use the runtime (either estimated or real) provided in the workload as a basis of their simulations. In our previous work we analyzed the impact on system performance of considering the resource sharing (memory bandwidth) of running jobs including a new resource model in the Alvio simulator. Based on this studies we proposed the LessConsume and LessConsume Threshold resource selection policies. Both are oriented to reduce the saturation of the shared resources thus increasing the performance of the system. The results showed how both resource allocation policies shown how the performance of the system can be improved by considering where the jobs are finally allocated.Using the LessConsume Threshold Resource Selection Policy, we propose a new backfilling strategy : the Resource Usage Aware Backfilling job scheduling policy. This is a backfilling based scheduling policy where the algorithms which decide which job has to be executed and how jobs have to be backfilled are based on a different Threshold configurations. This backfilling variant that considers how the shared resources are used by the scheduled jobs. Rather than backfilling the first job that can moved to the run queue based on the job arrival time or job size, it looks ahead to the next queued jobs, and tries to allocate jobs that would experience lower penalized runtime caused by the resource sharing saturation.In the paper we demostrate how the exchange of scheduling information between the local resource manager and the scheduler can improve substantially the performance of the system when the resource sharing is considered. We show how it can achieve a close response time performance that the shorest job first Backfilling with First Fit (oriented to improve the start time for the allocated jobs) providing a qualitative improvement in the number of killed jobs and in the percentage of penalized runtime.
Year
DOI
Venue
2009
10.1007/978-3-642-04633-9_4
JSSPP
Keywords
Field
DocType
lessconsume threshold resource selection,penalized runtime,job scheduling policy,next queued job,local resource manager,resource usage aware backfilling,new resource model,job arrival time,shared resource,job size,resource sharing,look ahead,resource manager,system performance,job scheduling,memory bandwidth,resource allocation
Resource management,Run queue,Memory bandwidth,Scheduling (computing),Computer science,Workload,Real-time computing,Resource allocation,Job scheduler,Shared resource
Conference
Volume
ISSN
Citations 
5798
0302-9743
5
PageRank 
References 
Authors
0.44
25
3
Name
Order
Citations
PageRank
Francesc Guim11059.24
Ivan Rodero245237.95
Julita Corbalan317412.00