Abstract
Dedicated wide-area network connections are increasingly employed in high-performance computing and big data scenarios. One might expect the performance and dynamics of data transfers over such connections to be easy to analyze due to the lack of competing traffic. However, non-linear transport dynamics and end-system complexities (e.g., multi-core hosts and distributed filesystems) can in fact make analysis surprisingly challenging. We present extensive measurements of memory-to-memory and disk-to-disk file transfers over 10Gbps physical and emulated connections with 0-366ms round trip times (RTTs). For memory-to-memory transfers, profiles of both TCP and UDT throughput as a function of RTT show concave and convex regions; large buffer sizes and more parallel flows lead to wider concave regions, which are highly desirable. TCP and UDT both also display complex throughput dynamics, as indicated by their Poincaré maps and Lyapunov exponents. For disk-to-disk transfers, we determine that high throughput can be achieved via a combination of parallel I/O threads, parallel network threads, and direct I/O mode. Our measurements also show that Lustre filesystems can be mounted over long-haul connections using LNet routers, although challenges remain in jointly optimizing file I/O and transport method parameters to achieve peak throughput.
Original language | English |
---|---|
Title of host publication | 2017 26th International Conference on Computer Communications and Networks, ICCCN 2017 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9781509029914 |
DOIs | |
State | Published - Sep 14 2017 |
Event | 26th International Conference on Computer Communications and Networks, ICCCN 2017 - Vancouver, Canada Duration: Jul 31 2017 → Aug 3 2017 |
Publication series
Name | 2017 26th International Conference on Computer Communications and Networks, ICCCN 2017 |
---|
Conference
Conference | 26th International Conference on Computer Communications and Networks, ICCCN 2017 |
---|---|
Country/Territory | Canada |
City | Vancouver |
Period | 07/31/17 → 08/3/17 |
Funding
This work is supported in part by the RAMSES and Net2013 projects, Office of Advanced Computing Research, U.S. Department of Energy, and by the Extreme Scale Systems Center, sponsored by U. S. Department of Defense, and performed at Oak Ridge National Laboratory managed by UT-Battelle, LLC for U.S. Department of Energy under Contract DE-AC05-00OR22725 and at Argonne National Laboratory under Contract DE-AC02-06CH11357.
Keywords
- Dedicated connections
- Lyapunov exponent
- Poincaré map
- TCP
- Throughput and file I/O profiles
- Throughput dynamics
- UDT
- Wide-area transport