Title | ||
---|---|---|
Characterizing the Influence of System Noise on Large-Scale Applications by Simulation |
Abstract | ||
---|---|---|
This paper presents an in-depth analysis of the impact of system noise on large-scale parallel application performance in realistic settings. Our analytical model shows that not only collective operations but also point-to-point communications influence the application's sensitivity to noise. We present a simulation toolchain that injects noise delays from traces gathered on common large-scale architectures into a LogGPS simulation and allows new insights into the scaling of applications in noisy environments. We investigate collective operations with up to 1 million processes and three applications (Sweep3D, AMG, and POP) with up to 32,000 processes.We show that the scale at which noise becomes a bottleneck is system-specific and depends on the structure of the noise. Simulations with different network speeds show that a 10x faster network does not improve application scalability. We quantify noise and conclude that our tools can be utilized to tune the noise signatures of a specific system. |
Year | DOI | Venue |
---|---|---|
2010 | 10.1109/SC.2010.12 | SC |
Keywords | Field | DocType |
common large-scale architecture,system noise,different network speed,application scalability,injects noise delay,large-scale applications,faster network,collective operation,large-scale parallel application performance,noise signature,loggps simulation,point to point,synchronization,noise,point to point communications,noise measurement,benchmark testing | Bottleneck,Synchronization,Noise measurement,Computer science,Parallel computing,Real-time computing,Point-to-point,Scaling,Toolchain,Benchmark (computing),Scalability | Conference |
Citations | PageRank | References |
93 | 3.57 | 14 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Torsten Hoefler | 1 | 2197 | 163.64 |
Timo Schneider | 2 | 312 | 18.39 |
Andrew Lumsdaine | 3 | 2754 | 236.74 |