Intelligent multi-zone residential HVAC control strategy based on deep reinforcement learning

Yan Du, Helia Zandi, Olivera Kotevska, Kuldeep Kurte, Jeffery Munk, Kadir Amasyali, Evan Mckee, Fangxing Li

Research output: Contribution to journalArticlepeer-review

189 Scopus citations

Abstract

Residential heating, ventilation, and air conditioning (HVAC) has been considered as an important demand response resource. However, the optimization of residential HVAC control is no trivial task due to the complexity of the thermal dynamic models of buildings and uncertainty associated with both occupant-driven heat loads and weather forecasts. In this paper, we apply a novel model-free deep reinforcement learning (RL) method, known as the deep deterministic policy gradient (DDPG), to generate an optimal control strategy for a multi-zone residential HVAC system with the goal of minimizing energy consumption cost while maintaining the users’ comfort. The applied deep RL-based method learns through continuous interaction with a simulated building environment and without referring to any prior model knowledge. Simulation results show that compared with the state-of-art deep Q network (DQN), the DDPG-based HVAC control strategy can reduce the energy consumption cost by 15% and reduce the comfort violation by 79%; and when compared with a rule-based HVAC control strategy, the comfort violation can be reduced by 98%. In addition, experiments with different building models and retail price models demonstrate that the well-trained DDPG-based HVAC control strategy has high generalization and adaptability to unseen environments, which indicates its practicability for real-world implementation.

Original languageEnglish
Article number116117
JournalApplied Energy
Volume281
DOIs
StatePublished - Jan 1 2021

Funding

The authors would like to acknowledge the support in part by the U.S. Department of Energy (DOE), including Office of Energy Efficiency and Renewable Energy under the Buildings Technologies Program, in part by CURENT which is an Engineering Research Center (ERC) funded by the U.S. National Science Foundation (NSF) and DOE under the NSF award EEC-1041877, and in part by the U.S. NSF award ECCS-1809458.

Keywords

  • Actor-critic learning
  • Deep deterministic policy gradient (DDPG)
  • Deep reinforcement learning (deep RL)
  • Demand response
  • Multi-zone residential HVAC

Fingerprint

Dive into the research topics of 'Intelligent multi-zone residential HVAC control strategy based on deep reinforcement learning'. Together they form a unique fingerprint.

Cite this