Energy and power aware job scheduling and resource management: Global survey - Initial analysis

Matthias Maiterth, Gregory Koenig, Kevin Pedretti, Siddhartha Jana, Natalie Bates, Andrea Borghesi, Dave Montoya, Andrea Bartolini, Milos Puzovic

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

24 Scopus citations

Abstract

This work describes the motivation and methodology of a first-of-its-kind global survey of HPC centers actively employing Energy and Power Aware Scheduling and Resource Management solutions for their production systems. The Energy-Efficient High-Performance-Computing Working-Group (EE HPC WG) Energy and Power Aware Job Scheduling and Resource Management (EPA JSRM) team conducted comprehensive interviews over the course of 2016 and 2017. In this work, we present the selection of participating sites, the motivation behind the survey, a detailed description of the questionnaire, and illustrate why getting a global view of the ongoing efforts is a major step towards more efficient systems. Job Scheduling and Resource Management is being tackled using new approaches regarding Power and Energy and has important implications for achievable center strategies. With this survey, we are laying foundations necessary to give insights in how problems and respective solutions are approached across sites and centers to allow to identify differences, similarities, solutions, and possible technology transfer across sites and centers. Upcoming work will focus on the survey responses and the analysis thereof. At the point of writing, the EPA JSRM team is in the major analysis phase of the centers' responses. By splitting the work in this fashion we achieve increased clarity in presentation and have the opportunity to generate more detailed analysis in benevolence of the community and reader.

Original languageEnglish
Title of host publicationProceedings - 2018 IEEE 32nd International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages685-693
Number of pages9
ISBN (Print)9781538655559
DOIs
StatePublished - Aug 3 2018
Externally publishedYes
Event32nd IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2018 - Vancouver, Canada
Duration: May 21 2018May 25 2018

Publication series

NameProceedings - 2018 IEEE 32nd International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2018

Conference

Conference32nd IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2018
Country/TerritoryCanada
CityVancouver
Period05/21/1805/25/18

Keywords

  • Computing
  • Energy
  • Performance
  • Power
  • Power-aware
  • Scheduling

Fingerprint

Dive into the research topics of 'Energy and power aware job scheduling and resource management: Global survey - Initial analysis'. Together they form a unique fingerprint.

Cite this