TY - GEN
T1 - Experiments and analyses of data transfers over wide-area dedicated connections
AU - Rao, Nageswara S.V.
AU - Liu, Qiang
AU - Sen, Satyabrata
AU - Hanley, Jesse
AU - Foster, Ian
AU - Kettimuthu, Rajkumar
AU - Wu, Chase Q.
AU - Yun, Daqing
AU - Towsley, Don
AU - Vardoyan, Gayane
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/9/14
Y1 - 2017/9/14
N2 - Dedicated wide-area network connections are increasingly employed in high-performance computing and big data scenarios. One might expect the performance and dynamics of data transfers over such connections to be easy to analyze due to the lack of competing traffic. However, non-linear transport dynamics and end-system complexities (e.g., multi-core hosts and distributed filesystems) can in fact make analysis surprisingly challenging. We present extensive measurements of memory-to-memory and disk-to-disk file transfers over 10Gbps physical and emulated connections with 0-366ms round trip times (RTTs). For memory-to-memory transfers, profiles of both TCP and UDT throughput as a function of RTT show concave and convex regions; large buffer sizes and more parallel flows lead to wider concave regions, which are highly desirable. TCP and UDT both also display complex throughput dynamics, as indicated by their Poincaré maps and Lyapunov exponents. For disk-to-disk transfers, we determine that high throughput can be achieved via a combination of parallel I/O threads, parallel network threads, and direct I/O mode. Our measurements also show that Lustre filesystems can be mounted over long-haul connections using LNet routers, although challenges remain in jointly optimizing file I/O and transport method parameters to achieve peak throughput.
AB - Dedicated wide-area network connections are increasingly employed in high-performance computing and big data scenarios. One might expect the performance and dynamics of data transfers over such connections to be easy to analyze due to the lack of competing traffic. However, non-linear transport dynamics and end-system complexities (e.g., multi-core hosts and distributed filesystems) can in fact make analysis surprisingly challenging. We present extensive measurements of memory-to-memory and disk-to-disk file transfers over 10Gbps physical and emulated connections with 0-366ms round trip times (RTTs). For memory-to-memory transfers, profiles of both TCP and UDT throughput as a function of RTT show concave and convex regions; large buffer sizes and more parallel flows lead to wider concave regions, which are highly desirable. TCP and UDT both also display complex throughput dynamics, as indicated by their Poincaré maps and Lyapunov exponents. For disk-to-disk transfers, we determine that high throughput can be achieved via a combination of parallel I/O threads, parallel network threads, and direct I/O mode. Our measurements also show that Lustre filesystems can be mounted over long-haul connections using LNet routers, although challenges remain in jointly optimizing file I/O and transport method parameters to achieve peak throughput.
KW - Dedicated connections
KW - Lyapunov exponent
KW - Poincaré map
KW - TCP
KW - Throughput and file I/O profiles
KW - Throughput dynamics
KW - UDT
KW - Wide-area transport
UR - http://www.scopus.com/inward/record.url?scp=85032291142&partnerID=8YFLogxK
U2 - 10.1109/ICCCN.2017.8038432
DO - 10.1109/ICCCN.2017.8038432
M3 - Conference contribution
AN - SCOPUS:85032291142
T3 - 2017 26th International Conference on Computer Communications and Networks, ICCCN 2017
BT - 2017 26th International Conference on Computer Communications and Networks, ICCCN 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 26th International Conference on Computer Communications and Networks, ICCCN 2017
Y2 - 31 July 2017 through 3 August 2017
ER -