Title
High Performance Relay Mechanism for MPI Communication Libraries Run on Multiple Private IP Address Clusters
Abstract
We have been developing a Grid-enabled MPI communication library called GridMPI, which is designed to run on multiple clusters connected to a wide-area network. Some of these clusters may use private IP addresses. Therefore, some mechanism to enable communication between private IP address clusters is required. Such a mechanism should be widely adoptable, and should provide high communication performance. In this paper, we propose a message relay mechanism to support private IP address clusters in the manner of the Interoperable MPI (IMPI) standard. Therefore, any MPI implementations which follow the IMPI standard can communicate with the relay. Furthermore, we also propose a trunking method in which multiple pairs of relay nodes simultaneously communicate between clusters to improve the available communication bandwidth. While the relay mechanism introduces an one-way latency of about 25 musec, the extra overhead is negligible, since the communication latency through a wide area network is a few hundred times as large as this. By using trunking, the inter-cluster communication bandwidth can improve as the number of trunks increases. We confirmed the effectiveness of the proposed method by experiments using a 10 Gbps emulated WAN environment. When relay nodes with 1 Gbps NICs are used, the performance of most of the NAS Parallel Benchmarks improved proportional to the number of trunks. Especially, using 8 trunks, FT and IS are 4.4 and 3.4 times faster, respectively, compared with the single trunk case. The results showed that the proposed method is effective for running MPI programs over high bandwidth-delay product networks.
Year
DOI
Venue
2008
10.1109/CCGRID.2008.88
CCGrid
Keywords
Field
DocType
workflow broker,conflicting schedule,mpi communication libraries run,multiple private ip address,high performance relay mechanism,current approach,interoperability,open systems,bandwidth,mpi,grid computing,network interfaces,network address translation,computer networks,message passing
Grid computing,Computer science,Latency (engineering),Interoperability,Computer network,Real-time computing,Wide area network,Message passing,Trunking,Relay,Distributed computing,Private network
Conference
ISSN
ISBN
Citations 
2376-4414
978-0-7695-3156-4
0
PageRank 
References 
Authors
0.34
10
7
Name
Order
Citations
PageRank
Ryousei Takano15116.12
Motohiko Matsuda2429.47
Tomohiro Kudoh334450.92
Yuetsu Kodama434749.44
Fumihiro Okazaki541.88
Yutaka Ishikawa61449188.06
Yasufumi Yoshizawa700.68