Abstract | ||
---|---|---|
Buffered CoScheduled (BCS) MPI is a novel implementation of MPI based on global synchronization of all system activities. BCS-MPI imposes a model where all processes and their communication are tightly scheduled at a very fine granularity. Thus, BCS-MPI provides a system that is much more controllable and deterministic. BCS-MPI leverages this regular behavior to provide a simple yet powerful monitoring and debugging subsystem that streamlines the analysis of parallel software. This subsystem, called Monitoring and Debugging System (MDS), provides exhaustive process and communication scheduling statistics. This paper covers in detail the design and implementation of the MDS subsystem, and demonstrates how the MDS can be used to monitor and debug not only parallel MPI applications but also the BCS-MPI runtime system itself. Additionally, we show that this functionality need not come at a significant performance loss. |
Year | DOI | Venue |
---|---|---|
2005 | 10.1109/IPDPS.2005.295 | IPDPS |
Keywords | Field | DocType |
parallel software,communication scheduling statistic,mds subsystem,parallel mpi application,system activity,novel implementation,exhaustive process,debugging subsystem,large-scale clusters,debugging system,bcs-mpi runtime system,message passing,control systems,scheduling,synchronisation,concurrent computing,clustering algorithms,system monitoring,parallel programming,synchronization,message passing interface,application software | Synchronization,Computer science,Scheduling (computing),Parallel computing,System monitoring,Background debug mode interface,Message passing,Debugging,Distributed computing,Algorithmic program debugging,Runtime system | Conference |
ISBN | Citations | PageRank |
0-7695-2312-9 | 0 | 0.34 |
References | Authors | |
10 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Juan Fernandez | 1 | 269 | 23.17 |
Fabrizio Petrini | 2 | 2050 | 165.82 |
Eitan Frachtenberg | 3 | 1060 | 85.08 |