Coordinated Optimization of Active and Reactive Power of Active Distribution Network Based on Safety Reinforcement Learning
A safe reinforcement learning method based on offline strategies is proposed. Through offline training of a large amount of historical operating data of the distribution network, it gets rid of the traditional optimization method. Dependence on complete and accurate models. First, combined with the...
Saved in:
| Main Authors: | , , , , , , |
|---|---|
| Format: | Article |
| Language: | zho |
| Published: |
State Grid Energy Research Institute
2024-03-01
|
| Series: | Zhongguo dianli |
| Subjects: | |
| Online Access: | https://www.electricpower.com.cn/CN/10.11930/j.issn.1004-9649.202311065 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | A safe reinforcement learning method based on offline strategies is proposed. Through offline training of a large amount of historical operating data of the distribution network, it gets rid of the traditional optimization method. Dependence on complete and accurate models. First, combined with the distribution network parameter information, an active and reactive power optimization model based on the constrained Markov decision process (CMDP) was established; then, a new safety reinforcement learning method was designed based on the original dual optimization method. The cost function is minimized while maximizing future discount rewards; finally, simulations are performed on power distribution system. The simulation results show that the proposed method can online generate a dispatching strategy that satisfies complex constraints and has economic benefits based on real-time observation information of the distribution network. |
|---|---|
| ISSN: | 1004-9649 |