Improving Data Transfer Throughput with Direct Search Optimization

Prasanna Balaprakash, Vitali Morozov, Rajkumar Kettimuthu, Kalyan Kumaran, Ian Foster

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Improving data transfer throughput over high-speed long-distance networks has become increasingly difficult. Numerous factors such as nondeterministic congestion, dynamics of the transfer protocol, and multiuser and multitask source and destination endpoints, as well as interactions among these factors, contribute to this difficulty. A promising approach to improving throughput consists in using parallel streams at the application layer. We formulate and solve the problem of choosing the number of such streams from a mathematical optimization perspective. We propose the use of direct search methods, a class of easy-to-implement and light-weight mathematical optimization algorithms, to improve the performance of data transfers by dynamically adapting the number of parallel streams in a manner that does not require domain expertise, instrumentation, analytical models, or historic data. We apply our method to transfers performed with the GridFTP protocol, and illustrate the effectiveness of the proposed algorithm when used within Globus, a state-of-the-art data transfer tool, on productionWAN links and servers. We show that when compared to user default settings our direct search methods can achieve up to 10x performance improvement under certain conditions. We also show that our method can overcome performance degradation due to external compute and network load on source end points, a common scenario at high performance computing facilities.

Original languageEnglish
Title of host publicationProceedings - 45th International Conference on Parallel Processing, ICPP 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages248-257
Number of pages10
ISBN (Electronic)9781509028238
DOIs
StatePublished - Sep 21 2016
Externally publishedYes
Event45th International Conference on Parallel Processing, ICPP 2016 - Philadelphia, United States
Duration: Aug 16 2016Aug 19 2016

Publication series

NameProceedings of the International Conference on Parallel Processing
Volume2016-September
ISSN (Print)0190-3918

Conference

Conference45th International Conference on Parallel Processing, ICPP 2016
Country/TerritoryUnited States
CityPhiladelphia
Period08/16/1608/19/16

Funding

This work was supported by the U.S. Department of Energy, Office of Science, Advanced Scientific Computing Research program under contract number DE-AC02-06CH11357.

FundersFunder number
National Science Foundation1440761
U.S. Department of Energy
Office of Science
Advanced Scientific Computing ResearchDE-AC02-06CH11357

    Keywords

    • Data transfer
    • Direct search
    • Parallelism
    • Tuning

    Fingerprint

    Dive into the research topics of 'Improving Data Transfer Throughput with Direct Search Optimization'. Together they form a unique fingerprint.

    Cite this