Title
MPI-FM: high performance MPI on workstation clusters
Abstract
Despite the emergence of high speed LANs, the communication performance available to applications on workstation clusters still falls short of that available on MPPs. A new generation of efficient messaging layers is needed to take advantage of the hardware performance and to deliver it to the application level. Communication software is the key element in bridging the communication performance gap separating MPPs and workstation clusters. MPI-FM is a high performance implementation of Message Passing Interface (MPI) for networks of workstations connected with a Myrinet network, built on top of the Fast Messages (FM) library. Based on the FM version 1.1 released in Fall 1995, MPI-FM achieves a minimum one-way latency of 19 μs and a peak bandwidth of 17.3 Mbyte/s with common MPI send and receive function calls. A direct comparison using published performance figures shows that MPI-FM running on SPARCstation 20 workstations connected with a relatively inexpensive Myrinet network outperforms the MPI implementations available on the IBM SP2 and the Cray T3D, both in latency and in bandwidth, for messages up to 2 kbyte in size. We describe the critical performance issues found in building a high level messaging library (MPI) on top of a low level messaging layer (FM), and the design solutions we adopted for them. One such issue was the direct and efficient support of common operations like adding and removing a header. Another was the exchange of critical information between the layers, like the location of the destination buffer. These two optimizations are both shown to be necessary, and their combination sufficient to achieve the aforementioned level of performance. The performance contribution of each of these optimizations is examined in some detail. These results delineate a new design approach for low level communication layers in which a closer integration with the upper layer and an appropriate balance of the communication pipeline stages are the key elements for high performance.
Year
DOI
Venue
1997
10.1006/jpdc.1996.1264
J. Parallel Distrib. Comput.
Keywords
Field
DocType
workstation cluster,high performance,message passing interface,receiver function
Data transmission,Computer science,Parallel computing,Bridging (networking),Workstation,Input/output,Message Passing Interface,Bandwidth (signal processing),Header,Operating system,Myrinet,Distributed computing
Journal
Volume
Issue
ISSN
40
1
Journal of Parallel and Distributed Computing
Citations 
PageRank 
References 
67
12.27
9
Authors
2
Name
Order
Citations
PageRank
Mario Lauria162895.12
Andrew A. Chien23696405.97