Abstract
As part of a larger effort, this work-in-progress reports the possible advantages of modifying conventional workflows used to generate labelled training samples and train machine learning (ML) models on them. We compare results from three different workflows using neutron scattering data analysis as the motivating application and report about 20% improvement in speedup, with no appreciable loss of model accuracy, over a baseline workflow.
Original language | English |
---|---|
Title of host publication | 2023 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2023 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 795-798 |
Number of pages | 4 |
ISBN (Electronic) | 9798350311990 |
DOIs | |
State | Published - 2023 |
Event | 2023 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2023 - St. Petersburg, United States Duration: May 15 2023 → May 19 2023 |
Publication series
Name | 2023 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2023 |
---|
Conference
Conference | 2023 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2023 |
---|---|
Country/Territory | United States |
City | St. Petersburg |
Period | 05/15/23 → 05/19/23 |
Funding
This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan) A portion of this research used resources at the Spallation Neutron Source, a DOE Office of Science User Facility operated by the Oak Ridge National Laboratory. A portion of this research used resources at the Argonne Leadership Computing Facility, a DOE Office of Science User Facility operated by the Argonne National Laboratory. A portion of this research used resources at the Brookhaven National Laboratory, a DOE Office of Science National Laboratory. This research was sponsored by ExaLearn, an Exascale Computing Project, DOE.
Keywords
- Neutron Data Analysis
- Parallel Machine Learning
- Workflow
- heterogeneous computing