TY - GEN
T1 - Machine Learning Methods for Connection RTT and Loss Rate Estimation Using MPI Measurements Under Random Losses
AU - Rao, Nageswara S.V.
AU - Imam, Neena
AU - Liu, Zhengchun
AU - Kettimuthu, Rajkumar
AU - Foster, Ian
N1 - Publisher Copyright:
© 2020, IFIP International Federation for Information Processing.
PY - 2020
Y1 - 2020
N2 - Scientific computations are expected to be increasingly distributed across wide-area networks, and Message Passing Interface (MPI) has been shown to scale to support their communications over long distances. Application-level measurements of MPI operations reflect the connection Round-Trip Time (RTT) and loss rate, and machine learning methods have been previously developed to estimate them under deterministic periodic losses. In this paper, we consider more complex, random losses with uniform, Poisson and Gaussian distributions. We study five disparate machine leaning methods, with linear and non-linear, and smooth and non-smooth properties, to estimate RTT and loss rate over 10 Gbps connections with 0–366 ms RTT. The diversity and complexity of these estimators combined with the randomness of losses and TCP’s non-linear response together rule out the selection of a single best among them; instead, we fuse them to retain their design diversity. Overall, the results show that accurate estimates can be generated at low loss rates but become inaccurate at loss rates 10% and higher, thereby illustrating both their strengths and limitations.
AB - Scientific computations are expected to be increasingly distributed across wide-area networks, and Message Passing Interface (MPI) has been shown to scale to support their communications over long distances. Application-level measurements of MPI operations reflect the connection Round-Trip Time (RTT) and loss rate, and machine learning methods have been previously developed to estimate them under deterministic periodic losses. In this paper, we consider more complex, random losses with uniform, Poisson and Gaussian distributions. We study five disparate machine leaning methods, with linear and non-linear, and smooth and non-smooth properties, to estimate RTT and loss rate over 10 Gbps connections with 0–366 ms RTT. The diversity and complexity of these estimators combined with the randomness of losses and TCP’s non-linear response together rule out the selection of a single best among them; instead, we fuse them to retain their design diversity. Overall, the results show that accurate estimates can be generated at low loss rates but become inaccurate at loss rates 10% and higher, thereby illustrating both their strengths and limitations.
KW - Generalization bounds
KW - Information fusion
KW - Loss rate
KW - Machine Learning
KW - Message Passing Interface
KW - Regression
KW - Round Trip Time
UR - http://www.scopus.com/inward/record.url?scp=85084172971&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-45778-5_11
DO - 10.1007/978-3-030-45778-5_11
M3 - Conference contribution
AN - SCOPUS:85084172971
SN - 9783030457778
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 154
EP - 174
BT - Machine Learning for Networking - 2nd IFIP TC 6 International Conference, MLN 2019, Revised Selected Papers
A2 - Boumerdassi, Selma
A2 - Renault, Éric
A2 - Mühlethaler, Paul
PB - Springer
T2 - 2nd International Conference on Machine Learning for Networking, MLN 2019
Y2 - 3 December 2019 through 5 December 2019
ER -