Training reinforcement learning models via an adversarial evolutionary algorithm

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

When training for control problems, more episodes used in training usually leads to better generalizability, but more episodes also requires significantly more training time. There are a variety of approaches for selecting the way that training episodes are chosen, including fixed episodes, uniform sampling, and stochastic sampling, but they can all leave gaps in the training landscape. In this work, we describe an approach that leverages an adversarial evolutionary algorithm to identify the worst performing states for a given model. We then use information about these states in the next cycle of training, which is repeated until the desired level of model performance is met. We demonstrate this approach with the OpenAI Gym cart-pole problem. We show that the adversarial evolutionary algorithm did not reduce the number of episodes required in training needed to attain model generalizability when compared with stochastic sampling, and actually performed slightly worse.

Original languageEnglish
Title of host publication51st International Conference on Parallel Processing, ICPP 2022 - Workshop Proceedings
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450394451
DOIs
StatePublished - Aug 29 2022
Event51st International Conference on Parallel Processing, ICPP 2022 - Virtual, Online, France
Duration: Aug 29 2022Sep 1 2022

Publication series

NameACM International Conference Proceeding Series

Conference

Conference51st International Conference on Parallel Processing, ICPP 2022
Country/TerritoryFrance
CityVirtual, Online
Period08/29/2209/1/22

Funding

This material is based upon work supported by the U.S. Department of Energy, Office of Science, Office of Advanced Scientific Computing Research, Robinson Pino, program manager, under contract number DE-AC05-00OR22725.

FundersFunder number
U.S. Department of Energy
Office of Science
Advanced Scientific Computing ResearchDE-AC05-00OR22725

    Keywords

    • adversarial evolutionary algorithms
    • reinforcement learning

    Fingerprint

    Dive into the research topics of 'Training reinforcement learning models via an adversarial evolutionary algorithm'. Together they form a unique fingerprint.

    Cite this