DeepHyper: Asynchronous Hyperparameter Search for Deep Neural Networks

Prasanna Balaprakash, Michael Salim, Thomas D. Uram, Venkat Vishwanath, Stefan M. Wild

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

107 Scopus citations

Abstract

Hyperparameters employed by deep learning (DL) methods play a substantial role in the performance and reliability of these methods in practice. Unfortunately, finding performance optimizing hyperparameter settings is a notoriously difficult task. Hyperparameter search methods typically have limited production-strength implementations or do not target scalability within a highly parallel machine, portability across different machines, experimental comparison between different methods, and tighter integration with workflow systems. In this paper, we present DeepHyper, a Python package that provides a common interface for the implementation and study of scalable hyperparameter search methods. It adopts the Balsam workflow system to hide the complexities of running large numbers of hyperparameter configurations in parallel on high-performance computing (HPC) systems. We implement and study asynchronous model-based search methods that consist of sampling a small number of input hyperparameter configurations and progressively fitting surrogate models over the input-output space until exhausting a user-defined budget of evaluations. We evaluate the efficacy of these methods relative to approaches such as random search, genetic algorithms, Bayesian optimization, and hyperband on DL benchmarks on CPU-and GPU-based HPC systems.

Original languageEnglish
Title of host publicationProceedings - 25th IEEE International Conference on High Performance Computing, HiPC 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages42-51
Number of pages10
ISBN (Electronic)9781538683866
DOIs
StatePublished - Jul 2 2018
Externally publishedYes
Event25th IEEE International Conference on High Performance Computing, HiPC 2018 - Bengaluru, India
Duration: Dec 17 2018Dec 20 2018

Publication series

NameProceedings - 25th IEEE International Conference on High Performance Computing, HiPC 2018

Conference

Conference25th IEEE International Conference on High Performance Computing, HiPC 2018
Country/TerritoryIndia
CityBengaluru
Period12/17/1812/20/18

Funding

This material is based upon work supported by the U.S. Department of Energy (DOE), Office of Science, Office of Advanced Scientific Computing Research, under Contract DE-AC02-06CH11357. This research used resources of the Argonne Leadership Computing Facility, which is a DOE Office of Science User Facility.

FundersFunder number
U.S. Department of Energy
Office of Science
Advanced Scientific Computing ResearchDE-AC02-06CH11357

    Keywords

    • Bayesian optimization
    • Hyperparameter search
    • deep learning
    • model-based search

    Fingerprint

    Dive into the research topics of 'DeepHyper: Asynchronous Hyperparameter Search for Deep Neural Networks'. Together they form a unique fingerprint.

    Cite this