Continual Learning via Dynamic Programming

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Continual learning (CL) algorithms seek to train a model when faced with similar tasks observed in a sequential manner. Despite promising methodological advancements, however, there is a lack of theoretical frameworks that enable analysis of learning challenges such as generalization and catastrophic forgetting, especially in applications where the tasks are generated continuously through a partial differential equation. To address this lack, we present a new theoretical framework that models the learning dynamics in CL through dynamic programming. Using the proposed framework, we derive a new method that adopts stochastic-gradient-driven alternating optimization to balance generalization and catastrophic forgetting. We establish conditions for convergence and show that, on CL benchmark datasets, our method achieves accuracies better than or comparable to those of existing state-of-the-art methods.

Original languageEnglish
Title of host publication2022 26th International Conference on Pattern Recognition, ICPR 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1350-1356
Number of pages7
ISBN (Electronic)9781665490627
DOIs
StatePublished - 2022
Externally publishedYes
Event26th International Conference on Pattern Recognition, ICPR 2022 - Montreal, Canada
Duration: Aug 21 2022Aug 25 2022

Publication series

NameProceedings - International Conference on Pattern Recognition
Volume2022-August
ISSN (Print)1051-4651

Conference

Conference26th International Conference on Pattern Recognition, ICPR 2022
Country/TerritoryCanada
CityMontreal
Period08/21/2208/25/22

Funding

ACKNOWLEDGMENT This work was supported by the U.S. Department of Energy, Office of Science, Advanced Scientific Computing Research, under Contract DE-AC02-06CH11357 and a DOE Early Career Research Program award. We are grateful for the computing resources from the Joint Laboratory for System Evaluation and Leadership Computing Facility at Argonne.

FundersFunder number
U.S. Department of Energy
Office of Science
Advanced Scientific Computing ResearchDE-AC02-06CH11357

    Keywords

    • Bellman principle
    • continual learning
    • dynamic programming

    Fingerprint

    Dive into the research topics of 'Continual Learning via Dynamic Programming'. Together they form a unique fingerprint.

    Cite this