Transformer Neural Networks with Spatiotemporal Attention for Predictive Control and Optimization of Industrial Processes

  • Ethan R. Gallup
  • , Jacob F. Tuttle
  • , Jake Immonen
  • , Blake Billings
  • , Kody M. Powell

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

In the context of real-time optimization and model predictive control of industrial systems, machine learning, and neural networks represent cutting-edge tools that hold promise for enhancing dynamic modeling. This work presents a transformer neural network architecture for real-time optimization and model predictive control. This network design includes a modified attention mechanism inspired by positional embedding attention from vision transformers and task-specific modifications to the input-output structure of the transformer's decoder stack. Experiments were conducted using data from a 450 MW coal-fired power plant to evaluate this approach's effectiveness. The transformer neural network was compared with conventional recurrent models, including GRU and LSTM neural networks. The transformer exhibited a 6% increase in the R-squared value of predictions and an 83% reduction in mean squared error. Computation time was also reduced by 84% compared to conventional recurrent models.

Original languageEnglish
Title of host publication2024 American Control Conference, ACC 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages382-387
Number of pages6
ISBN (Electronic)9798350382655
DOIs
StatePublished - 2024
Externally publishedYes
Event2024 American Control Conference, ACC 2024 - Toronto, Canada
Duration: Jul 10 2024Jul 12 2024

Publication series

NameProceedings of the American Control Conference
ISSN (Print)0743-1619

Conference

Conference2024 American Control Conference, ACC 2024
Country/TerritoryCanada
CityToronto
Period07/10/2407/12/24

Funding

Research supported by the Department of Energy. This research is funded by the United States Department of Energy project DE-FE0031754. The funding sources did not contribute to the study design or the writing of this report.

Fingerprint

Dive into the research topics of 'Transformer Neural Networks with Spatiotemporal Attention for Predictive Control and Optimization of Industrial Processes'. Together they form a unique fingerprint.

Cite this