Reinforcement learning control for data center refrigeration systems

Computer Engineering & Science ›› 2025, Vol. 47 ›› Issue (3): 422-433.

• High Performance Computing • Previous Articles Next Articles

Reinforcement learning control for data center refrigeration systems

WEI Dong 1，2，JIA Yuchen1，HAN Shaoran3

(1.School of Electrical and Information Engineering,Beijing University of Civil Engineering and Architecture,Beijing 100044;
2.Key Laboratory of Intelligent Processing for Building Big Data,
Beijing University of Civil Engineering and Architecture,Beijing 100044;
3.Beijing Jingcheng Ruida Electric Engineering Technology Co.,Ltd.,Beijing 100176,China)

Received:2023-07-13 Revised:2024-03-20 Online:2025-03-25 Published:2025-04-01

Abstract

Abstract: The refrigeration system in data centers needs to operate continuously throughout the year, and its energy consumption cannot be ignored. Moreover, traditional PID control methods struggle to achieve overall energy savings for the system. To address this, a reinforcement learning control strategy is proposed for data center refrigeration systems, with the control objective of enhancing the overall energy efficiency of the system while meeting cooling requirements. A two-layer hierarchical control structure is designed. The upper optimization layer introduces the multistep prediction-deep deterministic policy gradient (MP-DDPG) algorithm, which leverages DDPG to handle the multi-dimensional continuous action space of the refrigeration system to determine the water valve opening of the air hand- ling unit and the optimal setpoint for each loop in the chilling station system. Multistep prediction is employed to enhance algorithm efficiency and overcome the impact of large system delay during real-time control. The lower field control layer uses PID control to enable the controlled variables to track the optimal setpoints obtained from the optimization layer, achieving performance optimization without disrupting the existing field control system. To address the challenge of real-time control with model-free reinforcement learning, a system prediction model is first constructed, and the reinforcement learning controller is trained offline through interaction with this model. Subsequently, online real-time control is implemented. Experimental results show that compared to the traditional DDPG algorithm, the learning efficiency of the controller is improved by 50%. Compared to PID and MP-DQN (multistep prediction-deep Q network), the systems dynamic performance is improved, and the whole energy efficiency is increased by approximately 30.149% and 11.6%, respectively.

Key words: data center refrigeration system, predictive control, reinforcement learning, depth deterministic strategy gradient method, integrated learning

WEI Dong , JIA Yuchen, HAN Shaoran. Reinforcement learning control for data center refrigeration systems[J]. Computer Engineering & Science, 2025, 47(3): 422-433.

[1]	CHEN Junyan1, LI Xinmei1, ZHU Changhong2, XIAO Wei3. A routing optimization algorithm for software-defined optical transport network based on multi-view graph attention mechanism [J]. Computer Engineering & Science, 2025, 47(7): 1193-1204.
[2]	LI Tianyun, LI Tao, WEN Dong, YANG Hui, ZHANG Yutao, LUO Xin, DONG Dezun. A survey on artificial intelligence based congestion control [J]. Computer Engineering & Science, 2025, 47(6): 1018-1027.
[3]	DI Jian, WAN Xue, JIANG Limei, . An evolutionary reinforcement learning algorithm based on stochastic symmetric search [J]. Computer Engineering & Science, 2025, 47(5): 912-920.
[4]	ZHANG Zheng, XIA Xiaoyun, CHEN Zefeng, XIANG Yi. A staged strategy incorporating reinforcement learning to solve the travelling thief problem [J]. Computer Engineering & Science, 2025, 47(1): 140-149.
[5]	YU Shirui, JIANG Chunmao. A cloud computing virtual machine scheduling strategy based on fuzzy reinforcement learning [J]. Computer Engineering & Science, 2025, 47(1): 56-65.
[6]	ZHUANG Shu-xin, CHEN Yong-hong, HAO Yi-hang, WU Wei-wei, XU Xue-yong, WANG Wan-yuan. A population diversity-based robust policy generation method in adversarial game environments#br# [J]. Computer Engineering & Science, 2024, 46(6): 1081-1091.
[7]	DUAN Cheng-long, YUAN Jie, CHANG Qian-kun, ZHANG Ning-ning. Inverse reinforcement learning algorithm based on D2GA [J]. Computer Engineering & Science, 2024, 46(11): 2053-2062.
[8]	GU Ying-cheng, WEI Liu, JIANG Ning, CHENG Huan-yu, LIU Kai, SONG Yu, LIU Mei-zhao, TANG Lei, CHEN Yu, ZHANG Sheng. Edge server assignment for distributed interactive applications in edge environments [J]. Computer Engineering & Science, 2024, 46(10): 1748-1756.
[9]	CAI Yu, GUAN Zheng, WANG Zeng-wen, WANG Xue, YANG Zhi-jun. Resource allocation algorithm for distinguished services in vehicular networks based on multi-agent deep reinforcement learning [J]. Computer Engineering & Science, 2024, 46(10): 1757-1764.
[10]	ZENG Fan-feng, WANG Chun-zhen, LI Chen. An unsupervised video summarization algorithm based on deep and shallow feature fusion [J]. Computer Engineering & Science, 2023, 45(9): 1602-1610.
[11]	WANG Yang, CHEN Zhi-bin. A dynamic graph transformer model for solving CVRP [J]. Computer Engineering & Science, 2023, 45(5): 859-868.
[12]	PENG Kun-yan, YIN Xiang, LIU Xiao-zhu, LI Heng-yu. A strategy search method based on particle swarm optimization and deep reinforcement learning [J]. Computer Engineering & Science, 2023, 45(4): 718-725.
[13]	GUAN Rui, DING Jia-man, JIA Lian-yin, YOU Jin-guo, JIANG Ying, . A diversity document ranking algorithm based on reinforcement learning [J]. Computer Engineering & Science, 2020, 42(9): 1697-1703.
[14]	CAI Yue, YOU Jin-guo, DING Jia-man. Proximal policy optimization and adversarial learning based dialog generation [J]. Computer Engineering & Science, 2020, 42(9): 1680-1689.
[15]	HAN Hu, SUN Tian-yue, ZHAO Qi-tao. Generative adversarial networks with autoencoder for text generation [J]. Computer Engineering & Science, 2020, 42(9): 1704-1710.

Reinforcement learning control for data center refrigeration systems

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles

Metrics

Comments