TY - GEN
T1 - Understanding and Leveraging the I/O Patterns of Emerging Machine Learning Analytics
AU - Gainaru, Ana
AU - Ganyushin, Dmitry
AU - Xie, Bing
AU - Kurc, Tahsin
AU - Saltz, Joel
AU - Oral, Sarp
AU - Podhorszki, Norbert
AU - Poeschel, Franz
AU - Huebl, Axel
AU - Klasky, Scott
N1 - Publisher Copyright:
© 2022, Springer Nature Switzerland AG.
PY - 2022
Y1 - 2022
N2 - The scientific community is currently experiencing unprecedented amounts of data generated by cutting-edge science facilities. Soon facilities will be producing up to 1 PB/s which will force scientist to use more autonomous techniques to learn from the data. The adoption of machine learning methods, like deep learning techniques, in large-scale workflows comes with a shift in the workflow’s computational and I/O patterns. These changes often include iterative processes and model architecture searches, in which datasets are analyzed multiple times in different formats with different model configurations in order to find accurate, reliable and efficient learning models. This shift in behavior brings changes in I/O patterns at the application level as well at the system level. These changes also bring new challenges for the HPC I/O teams, since these patterns contain more complex I/O workloads. In this paper we discuss the I/O patterns experienced by emerging analytical codes that rely on machine learning algorithms and highlight the challenges in designing efficient I/O transfers for such workflows. We comment on how to leverage the data access patterns in order to fetch in a more efficient way the required input data in the format and order given by the needs of the application and how to optimize the data path between collaborative processes. We will motivate our work and show performance gains with a study case of medical applications.
AB - The scientific community is currently experiencing unprecedented amounts of data generated by cutting-edge science facilities. Soon facilities will be producing up to 1 PB/s which will force scientist to use more autonomous techniques to learn from the data. The adoption of machine learning methods, like deep learning techniques, in large-scale workflows comes with a shift in the workflow’s computational and I/O patterns. These changes often include iterative processes and model architecture searches, in which datasets are analyzed multiple times in different formats with different model configurations in order to find accurate, reliable and efficient learning models. This shift in behavior brings changes in I/O patterns at the application level as well at the system level. These changes also bring new challenges for the HPC I/O teams, since these patterns contain more complex I/O workloads. In this paper we discuss the I/O patterns experienced by emerging analytical codes that rely on machine learning algorithms and highlight the challenges in designing efficient I/O transfers for such workflows. We comment on how to leverage the data access patterns in order to fetch in a more efficient way the required input data in the format and order given by the needs of the application and how to optimize the data path between collaborative processes. We will motivate our work and show performance gains with a study case of medical applications.
KW - Data management
KW - Deep learning methods
KW - Emerging HPC applications
KW - I/O optimization
KW - I/O patterns
UR - http://www.scopus.com/inward/record.url?scp=85127040388&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-96498-6_7
DO - 10.1007/978-3-030-96498-6_7
M3 - Conference contribution
AN - SCOPUS:85127040388
SN - 9783030964979
T3 - Communications in Computer and Information Science
SP - 119
EP - 138
BT - Driving Scientific and Engineering Discoveries Through the Integration of Experiment, Big Data, and Modeling and Simulation - 21st Smoky Mountains Computational Sciences and Engineering, SMC 2021, Revised Selected Papers
A2 - Nichols, [given-name]Jeffrey
A2 - Maccabe, [given-name]Arthur ‘Barney’
A2 - Nutaro, James
A2 - Pophale, Swaroop
A2 - Devineni, Pravallika
A2 - Ahearn, Theresa
A2 - Verastegui, Becky
PB - Springer Science and Business Media Deutschland GmbH
T2 - 21st Smoky Mountains Computational Sciences and Engineering Conference, SMC 2021
Y2 - 18 October 2021 through 20 October 2021
ER -