Abstract
Improving data transfer throughput over high-speed long-distance networks has become increasingly difficult. Numerous factors such as nondeterministic congestion, dynamics of the transfer protocol, and multiuser and multitask source and destination endpoints, as well as interactions among these factors, contribute to this difficulty. A promising approach to improving throughput consists in using parallel streams at the application layer. We formulate and solve the problem of choosing the number of such streams from a mathematical optimization perspective. We propose the use of direct search methods, a class of easy-to-implement and light-weight mathematical optimization algorithms, to improve the performance of data transfers by dynamically adapting the number of parallel streams in a manner that does not require domain expertise, instrumentation, analytical models, or historic data. We apply our method to transfers performed with the GridFTP protocol, and illustrate the effectiveness of the proposed algorithm when used within Globus, a state-of-the-art data transfer tool, on productionWAN links and servers. We show that when compared to user default settings our direct search methods can achieve up to 10x performance improvement under certain conditions. We also show that our method can overcome performance degradation due to external compute and network load on source end points, a common scenario at high performance computing facilities.
Original language | English |
---|---|
Title of host publication | Proceedings - 45th International Conference on Parallel Processing, ICPP 2016 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 248-257 |
Number of pages | 10 |
ISBN (Electronic) | 9781509028238 |
DOIs | |
State | Published - Sep 21 2016 |
Externally published | Yes |
Event | 45th International Conference on Parallel Processing, ICPP 2016 - Philadelphia, United States Duration: Aug 16 2016 → Aug 19 2016 |
Publication series
Name | Proceedings of the International Conference on Parallel Processing |
---|---|
Volume | 2016-September |
ISSN (Print) | 0190-3918 |
Conference
Conference | 45th International Conference on Parallel Processing, ICPP 2016 |
---|---|
Country/Territory | United States |
City | Philadelphia |
Period | 08/16/16 → 08/19/16 |
Funding
This work was supported by the U.S. Department of Energy, Office of Science, Advanced Scientific Computing Research program under contract number DE-AC02-06CH11357.
Keywords
- Data transfer
- Direct search
- Parallelism
- Tuning