Evolving deep networks using HPC

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

41 Scopus citations

Abstract

While a large number of deep learning networks have been studied and published that produce outstanding results on natural image datasets, these datasets only make up a fraction of those to which deep learning can be applied. These datasets include text data, audio data, and arrays of sensors that have very different characteristics than natural images. As these “best” networks for natural images have been largely discovered through experimentation and cannot be proven optimal on some theoretical basis, there is no reason to believe that they are the optimal network for these drastically different datasets. Hyperparameter search is thus often a very important process when applying deep learning to a new problem. In this work we present an evolutionary approach to searching the possible space of network hyperparameters and construction that can scale to 18, 000 nodes. This approach is applied to datasets of varying types and characteristics where we demonstrate the ability to rapidly find best hyperparameters in order to enable practitioners to quickly iterate between idea and result.

Original languageEnglish
Title of host publicationProceedings of MLHPC 2017
Subtitle of host publicationMachine Learning in HPC Environments - Held in conjunction with SC 2017: The International Conference for High Performance Computing, Networking, Storage and Analysis
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9781450351379
DOIs
StatePublished - Nov 12 2017
Event2017 Machine Learning in HPC Environments, MLHPC 2017 - Denver, United States
Duration: Nov 12 2017Nov 17 2017

Publication series

NameProceedings of MLHPC 2017: Machine Learning in HPC Environments - Held in conjunction with SC 2017: The International Conference for High Performance Computing, Networking, Storage and Analysis

Conference

Conference2017 Machine Learning in HPC Environments, MLHPC 2017
Country/TerritoryUnited States
CityDenver
Period11/12/1711/17/17

Funding

Support for participating scientists was provided by NSF and DOE (USA) by CAPES and CNPq (Brazil), by CoNaCyT (Mexico), by Proyecto Basal FB 0821, CONICYT PIA ACT1413, Fondecyt 3170845 and 11130133 (Chile), by CONCYTEC, DGI-PUCP and IDI/IGI-UNI (Peru), by Latin American Center for Physics (CLAF) and by RAS and the Russian Ministry of Education and Science (Russia). MINERvA is supported using the resources of the Fermi National Accelerator Laboratory (Fermilab), a U.S. Department of Energy, Office of Science, HEP User Facility. Fermilab is managed by Fermi Research Alliance, LLC (FRA), acting under Contract No. DE-AC02-07CH11359, which included the MINERvA construction project. Construction support also was granted by the United States National Science Foundation under Award PHY-0619727 and by the University of Rochester. Notice: This manuscript has been authored by UT-Battelle, LLC under contract DE-AC05-00OR22725, and Fermi Research Alliance, LLC (FRA) under contract DE-AC02-07CH11359 with the US Department of Energy (DOE). The US government retains and the publisher, by accepting the article for publication, acknowledges that the US government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for US government purposes. DOE will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http: //energy.gov/downloads/doe-public-access-plan). Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of the United States government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only. MLHPC’17, November 12–17, 2017, Denver, CO, USA © 2017 Copyright held by the owner/author(s). Publication rights licensed to Association for Computing Machinery. ACM ISBN 978-1-4503-5137-9/17/11...$15.00 https://doi.org/10.1145/3146347.3146355 Research sponsored by the Laboratory Directed Research and Development Program of Oak Ridge National Laboratory, managed by UT-Battelle, LLC, for the U. S. Department of Energy. This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725. We would like to thank the MINERvA collaboration for the use of their simulated data and for many useful and stimulating conversations. MINERvA is supported using the resources of the Fermi National Accelerator Laboratory (Fermilab), a U.S. Department of Energy, Office of Science, HEP User Facility. Fermilab is managed by Fermi Research Alliance, LLC (FRA), acting under Contract No. DE-AC02-07CH11359, which included the MINERvA construction project. Construction support also was granted by the United States National Science Foundation under Award PHY-0619727 and by the University of Rochester. Support for participating scientists was provided by NSF and DOE (USA) by CAPES and CNPq (Brazil), by CoNaCyT (Mexico), by Proyecto Basal FB 0821, CONICYT PIA ACT1413, Fondecyt 3170845 and 11130133 (Chile), by CONCYTEC, DGI-PUCP and IDI/IGI-UNI (Peru), by Latin American Center for Physics (CLAF) and by RAS and the Russian Ministry of Education and Science (Russia). This work benefited from the use of the SasView application, originally developed under NSF award DMR-0520547. SasView contains code developed with funding from the European Union’s Horizon 2020 research and innovation programme under the SINE2020 project, grant agreement No. 654000. This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725. Research sponsored by the Laboratory Directed Research and Development Program of Oak Ridge National Laboratory, managed by UT-Battelle, LLC, for the U. S. Department of Energy. This work benefited from the use of the SasView application, originally developed under NSF award DMR-0520547. SasView contains code developed with funding from the European Union’s Horizon 2020 research and innovation programme under the SINE2020 project, grant agreement No. 654000.

FundersFunder number
CLAF
DGI-PUCP
Fermi Research Alliance, LLC
IDI
IGI-UNI
Latin American Center for Physics
Proyecto BasalFB 0821
National Science FoundationDMR-0520547, PHY-0619727
U.S. Department of EnergyDE-AC05-00OR22725, DE-AC02-07CH11359
FacebookACT1413
Office of Science
Oak Ridge National Laboratory
Fermilab
University of Rochester
Horizon 2020 Framework Programme654000
Rochester Academy of Science
United States-Israel Binational Science Foundation
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior
Agencia Nacional de Investigación y DesarrolloPIA ACT1413
Fondo Nacional de Desarrollo Científico y Tecnológico3170845, 11130133
Consejo Nacional de Ciencia y Tecnología
Ministry of Education and Science of the Russian Federation
Conselho Nacional de Desenvolvimento Científico e Tecnológico
National Science Foundation
Consejo Nacional de Ciencia, Tecnología e Innovación Tecnológica

    Keywords

    • Deep learning
    • Evolutionary algorithms
    • High performance computing
    • Hyperparameter optimization

    Fingerprint

    Dive into the research topics of 'Evolving deep networks using HPC'. Together they form a unique fingerprint.

    Cite this