TY - GEN
T1 - Experimental analysis of InfiniBand transport services on WAN
AU - Yu, Weikuan
AU - Rao, Nageswara
AU - Vetter, Jeffrey
PY - 2008
Y1 - 2008
N2 - InfiniBand Architecture (IBA) has emerged as a standard system-area interconnect in industry for both data-center and high performance computing. While IBA continues to evolve with more capabilities, it has yet to permeate the field of grid and pervasive computing across wide area. This requires a software infrastructure to take the full advantage of IBA networking mechanisms without being dwarfed by the long distances. In this paper, we present a detailed analysis of InfiniBand transport services for their suitability to data transfer and message passing in the wide area. Three of the common InfiniBand transport services have been examined, including Reliable Connection (RC), Unreliable Connection (UC), and Unreliable Datagram (UD). Our analysis indicates that UC and UD are better suited to provide high bandwidth for MPI on wide-area networks (WAN). We have also demonstrated that the current existing MPI implementations, designed over RC and UD, can be tuned to provide improved MPI bandwidth on WAN. Furthermore, by developing a UC-based MPI implementation, we have shown that, at a distance of 8600 miles, MPI over UC can improve MPI bandwidth by as much as 100%.
AB - InfiniBand Architecture (IBA) has emerged as a standard system-area interconnect in industry for both data-center and high performance computing. While IBA continues to evolve with more capabilities, it has yet to permeate the field of grid and pervasive computing across wide area. This requires a software infrastructure to take the full advantage of IBA networking mechanisms without being dwarfed by the long distances. In this paper, we present a detailed analysis of InfiniBand transport services for their suitability to data transfer and message passing in the wide area. Three of the common InfiniBand transport services have been examined, including Reliable Connection (RC), Unreliable Connection (UC), and Unreliable Datagram (UD). Our analysis indicates that UC and UD are better suited to provide high bandwidth for MPI on wide-area networks (WAN). We have also demonstrated that the current existing MPI implementations, designed over RC and UD, can be tuned to provide improved MPI bandwidth on WAN. Furthermore, by developing a UC-based MPI implementation, we have shown that, at a distance of 8600 miles, MPI over UC can improve MPI bandwidth by as much as 100%.
UR - http://www.scopus.com/inward/record.url?scp=51849105239&partnerID=8YFLogxK
U2 - 10.1109/NAS.2008.65
DO - 10.1109/NAS.2008.65
M3 - Conference contribution
AN - SCOPUS:51849105239
SN - 9780769531878
T3 - Proceedings of the 2008 IEEE International Conference on Networking, Architecture, and Storage - IEEE NAS 2008
SP - 233
EP - 240
BT - Proceedings of the 2008 IEEE International Conference on Networking, Architecture, and Storage - IEEE NAS 2008
T2 - 2008 IEEE International Conference on Networking, Architecture, and Storage - IEEE NAS 2008
Y2 - 12 June 2008 through 14 June 2008
ER -