Resource Scheduling Method for Integration of TT&C and Observation Based on Multi-Agent Deep Reinforcement Learning

With the development of satellite communication technology and the continuous expansion of the constellation scale, the integration of TT&C and observation technology has become the mainstream trend.The large constellation scale, many scheduling objects and complex operation joint control bring...

Full description

Saved in:
Bibliographic Details
Main Authors: Siyue CHENG, Haoran LI, Weigang BAI, Di ZHOU, Yan ZHU
Format: Article
Language:zho
Published: Post&Telecom Press Co.,LTD 2023-03-01
Series:天地一体化信息网络
Subjects:
Online Access:http://www.j-sigin.com.cn/zh/article/doi/10.11959/j.issn.2096-8930.2023002/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850091721061302272
author Siyue CHENG
Haoran LI
Weigang BAI
Di ZHOU
Yan ZHU
author_facet Siyue CHENG
Haoran LI
Weigang BAI
Di ZHOU
Yan ZHU
author_sort Siyue CHENG
collection DOAJ
description With the development of satellite communication technology and the continuous expansion of the constellation scale, the integration of TT&C and observation technology has become the mainstream trend.The large constellation scale, many scheduling objects and complex operation joint control bring great challenges to the integrated resource scheduling of satellite network TT&C and observation.Subject to the low solution effi ciency and complex constraints of scheduling algorithms, the traditional TT&C resource scheduling technology adopts the advance injection TT&C instructions to perform tasks according to the fi xed deployment, which is diffi cult to meet the scheduling needs of emergencies and emergency tasks.Therefore, a kind of resource scheduling method based on multi-agent actor-Agent Actor-Critic Deterministic Policy Gradient Algorithms (MADDPG) was presented.With centralized training and distributed execution, the multi-agent model of integrated task of TT&C and observation was established.By analyzed the scheduling strategy of neighbor agent, the response speed of local information was improved.According to the model and constraints in the integrated resource scheduling problem of TT&C and observation, selected signifi cant and interpretable constraints, then established the multi-agent resource scheduling reinforcement learning model, and carried on the simulation test.The simulation results showed that the task benefi t of this method was 22% higher than the traditional method.
format Article
id doaj-art-ca8a2bf4fd884c2db3b7709a4a32055f
institution DOAJ
issn 2096-8930
language zho
publishDate 2023-03-01
publisher Post&Telecom Press Co.,LTD
record_format Article
series 天地一体化信息网络
spelling doaj-art-ca8a2bf4fd884c2db3b7709a4a32055f2025-08-20T02:42:18ZzhoPost&Telecom Press Co.,LTD天地一体化信息网络2096-89302023-03-014122259531878Resource Scheduling Method for Integration of TT&C and Observation Based on Multi-Agent Deep Reinforcement LearningSiyue CHENGHaoran LIWeigang BAIDi ZHOUYan ZHUWith the development of satellite communication technology and the continuous expansion of the constellation scale, the integration of TT&C and observation technology has become the mainstream trend.The large constellation scale, many scheduling objects and complex operation joint control bring great challenges to the integrated resource scheduling of satellite network TT&C and observation.Subject to the low solution effi ciency and complex constraints of scheduling algorithms, the traditional TT&C resource scheduling technology adopts the advance injection TT&C instructions to perform tasks according to the fi xed deployment, which is diffi cult to meet the scheduling needs of emergencies and emergency tasks.Therefore, a kind of resource scheduling method based on multi-agent actor-Agent Actor-Critic Deterministic Policy Gradient Algorithms (MADDPG) was presented.With centralized training and distributed execution, the multi-agent model of integrated task of TT&C and observation was established.By analyzed the scheduling strategy of neighbor agent, the response speed of local information was improved.According to the model and constraints in the integrated resource scheduling problem of TT&C and observation, selected signifi cant and interpretable constraints, then established the multi-agent resource scheduling reinforcement learning model, and carried on the simulation test.The simulation results showed that the task benefi t of this method was 22% higher than the traditional method.http://www.j-sigin.com.cn/zh/article/doi/10.11959/j.issn.2096-8930.2023002/integration of TT&C and observationlarge-scale constellation systemresources schedulingmulti-agent deep reinforcement learningtasks reward
spellingShingle Siyue CHENG
Haoran LI
Weigang BAI
Di ZHOU
Yan ZHU
Resource Scheduling Method for Integration of TT&C and Observation Based on Multi-Agent Deep Reinforcement Learning
天地一体化信息网络
integration of TT&C and observation
large-scale constellation system
resources scheduling
multi-agent deep reinforcement learning
tasks reward
title Resource Scheduling Method for Integration of TT&C and Observation Based on Multi-Agent Deep Reinforcement Learning
title_full Resource Scheduling Method for Integration of TT&C and Observation Based on Multi-Agent Deep Reinforcement Learning
title_fullStr Resource Scheduling Method for Integration of TT&C and Observation Based on Multi-Agent Deep Reinforcement Learning
title_full_unstemmed Resource Scheduling Method for Integration of TT&C and Observation Based on Multi-Agent Deep Reinforcement Learning
title_short Resource Scheduling Method for Integration of TT&C and Observation Based on Multi-Agent Deep Reinforcement Learning
title_sort resource scheduling method for integration of tt c and observation based on multi agent deep reinforcement learning
topic integration of TT&C and observation
large-scale constellation system
resources scheduling
multi-agent deep reinforcement learning
tasks reward
url http://www.j-sigin.com.cn/zh/article/doi/10.11959/j.issn.2096-8930.2023002/
work_keys_str_mv AT siyuecheng resourceschedulingmethodforintegrationofttcandobservationbasedonmultiagentdeepreinforcementlearning
AT haoranli resourceschedulingmethodforintegrationofttcandobservationbasedonmultiagentdeepreinforcementlearning
AT weigangbai resourceschedulingmethodforintegrationofttcandobservationbasedonmultiagentdeepreinforcementlearning
AT dizhou resourceschedulingmethodforintegrationofttcandobservationbasedonmultiagentdeepreinforcementlearning
AT yanzhu resourceschedulingmethodforintegrationofttcandobservationbasedonmultiagentdeepreinforcementlearning