Abstract
Residential heating, ventilation, and air conditioning (HVAC) has been considered as an important demand response resource. However, the optimization of residential HVAC control is no trivial task due to the complexity of the thermal dynamic models of buildings and uncertainty associated with both occupant-driven heat loads and weather forecasts. In this paper, we apply a novel model-free deep reinforcement learning (RL) method, known as the deep deterministic policy gradient (DDPG), to generate an optimal control strategy for a multi-zone residential HVAC system with the goal of minimizing energy consumption cost while maintaining the users’ comfort. The applied deep RL-based method learns through continuous interaction with a simulated building environment and without referring to any prior model knowledge. Simulation results show that compared with the state-of-art deep Q network (DQN), the DDPG-based HVAC control strategy can reduce the energy consumption cost by 15% and reduce the comfort violation by 79%; and when compared with a rule-based HVAC control strategy, the comfort violation can be reduced by 98%. In addition, experiments with different building models and retail price models demonstrate that the well-trained DDPG-based HVAC control strategy has high generalization and adaptability to unseen environments, which indicates its practicability for real-world implementation.
Original language | English |
---|---|
Article number | 116117 |
Journal | Applied Energy |
Volume | 281 |
DOIs | |
State | Published - Jan 1 2021 |
Funding
The authors would like to acknowledge the support in part by the U.S. Department of Energy (DOE), including Office of Energy Efficiency and Renewable Energy under the Buildings Technologies Program, in part by CURENT which is an Engineering Research Center (ERC) funded by the U.S. National Science Foundation (NSF) and DOE under the NSF award EEC-1041877, and in part by the U.S. NSF award ECCS-1809458.
Keywords
- Actor-critic learning
- Deep deterministic policy gradient (DDPG)
- Deep reinforcement learning (deep RL)
- Demand response
- Multi-zone residential HVAC