Energy Storage Science and Technology ›› 2025, Vol. 14 ›› Issue (8): 3138-3148.doi: 10.19799/j.cnki.2095-4239.2025.0296

• Energy Storage System and Engineering • Previous Articles    

Cooperative primary frequency modulation control method for distributed energy storage based on reinforcement learning-model predictive control

Qian MA1(), Liang XIAO1, Bing CHENG2, Qin GAO1, Chunxiao LIU1, Yihua ZHU3, Chengxiang LI3   

  1. 1.China Southern Power Grid Power Dispatching and Control Center, Guangzhou 510530, Guangdong, China
    2.Hainan Power Grid Company Limited, Haikou 570203, Hainan, China
    3.State Key Laboratory of HVDC (Electric Power Research Institute of China Southern Power Grid Company Limited), Guangzhou 510663, Guangdong, China
  • Received:2025-03-27 Revised:2025-04-30 Online:2025-08-28 Published:2025-08-18
  • Contact: Qian MA E-mail:maqian@csg.cn

Abstract:

To enhance the frequency characteristics of power grids and fully leverage the rapid response advantages of distributed energy storage systems (DESSs), a cooperative primary frequency control method based on reinforcement learning-model predictive control (RL-MPC) is proposed. First, a primary frequency control model incorporating DESSs is established based on frequency response characteristics, state of charge (SOC), and power control strategies. Then, a hierarchical mixed control architecture is designed: the upper layer employs a deep Q-network (DQN) to dynamically optimize the MPC weight matrix while sensing frequency deviation, rate of change, and SOC distribution entropy in real time. The lower layer utilizes distributed MPC to determine the output sequences of multi-node energy storage units and introduces a graph attention network (GAT) to achieve adaptive optimization of the communication topology. This approach reduces computational complexity in coordinated control and enhances the strategy's generalization capability. Finally, simulations conducted in Matlab/Simulink verify that the proposed method effectively improves the primary frequency response speed and control accuracy of DESSs, thereby strengthening overall power system stability.

Key words: distributed energy storage, frequency modulation, model predictive control, reinforcement learning, graph attention network

CLC Number: