Abstract | ||
---|---|---|
Low latency collective communications are key to application scalability. As systems grow larger, minimizing collective communication time becomes increasingly challenging. Offload is an effective technique for accelerating collective operations, however, algorithms for collective communication constantly evolve such that flexible implementations are critical. This paper presents triggered operations--a semantic building block that allows the key components of collective communications to be offloaded while allowing the host side software to define the algorithm. Simulations are used to demonstrate the performance improvements achievable through the offload of MPI_Allreduce using these building blocks. |
Year | DOI | Venue |
---|---|---|
2011 | 10.1109/HOTI.2011.15 | Hot Interconnects |
Keywords | Field | DocType |
semantic building block,application scalability,building block,effective technique,collective operation,enabling flexible collective communication,collective communication time,flexible implementation,collective communication,triggered operations,low latency collective communication,key component,network interface,hardware,radiation detectors,radiation detector,message passing,noise,semantics,network interfaces,mpi,computer network | Computer science,Parallel computing,Collective communication,Computer network,Implementation,Software,Latency (engineering),Message passing,Semantics,Scalability,Network interface,Distributed computing | Conference |
ISBN | Citations | PageRank |
978-0-7695-4537-0 | 10 | 0.66 |
References | Authors | |
10 | 7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Keith D. Underwood | 1 | 847 | 77.39 |
Jerrie Coffman | 2 | 10 | 0.66 |
Roy Larsen | 3 | 10 | 0.66 |
K. Scott Hemmert | 4 | 577 | 50.62 |
Brian W. Barrett | 5 | 130 | 9.27 |
Ron Brightwell | 6 | 1060 | 94.72 |
Michael Levenhagen | 7 | 26 | 2.74 |