Abstract
Monitoring and testing for regression of large-scale systems such as the NCSA's Blue Waters supercomputer are challenging tasks. In this paper, we describe the solution we came up with to perform those tasks. Our goal was to find an automated solution for running user-level regression tests to evaluate system usability and performance. Jenkins, an automation server software, was chosen for its versatility, large user base, and multitude of plugins including collecting data and plotting test results over time. We describe our Jenkins deployment to launch and monitor jobs on remote HPC system, perform authentication with one-time password, and integrate with our LDAP server for its authorization. We show some use cases and describe our best practices for successfully using Jenkins as a user-level system-wide regression testing and monitoring framework for large supercomputer systems.
Original language | English |
---|---|
Article number | e4339 |
Journal | Concurrency and Computation: Practice and Experience |
Volume | 30 |
Issue number | 1 |
DOIs | |
State | Published - Jan 10 2018 |
Funding
This material is based upon work performed using computational resources at the National Institute for Computational Sciences (NICS). NICS is supported by the University of Tennessee and Oak Ridge National Laboratory through the Joint Institute for Computational Sciences.¶ This material is part of the Blue Waters sustained-petascale computing project, which is supported by the National Science Foundation (awards OCI-0725070 and ACI-1238993) and the state of Illinois. Blue Waters is a joint effort of the University of Illinois at Urbana-Champaign and its National Center for Supercomputing Applications. A portion of this work used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the US Department of Energy under Contract No. DE-AC05-00OR22725.
Keywords
- applications
- benchmarking
- performance
- regression testing
- system-monitoring