Optimization of Jamming Type Selection for Countering Multifunction Radar Based on Generative Adversarial Imitation Learning
In recent years, deep reinforcement learning (DRL) has made some progress in jamming type selection (JTS). However, during the training process of the agent, exploration of the action space is necessary, which leads to poor jamming effects in the early stages of training, posing a significant threat...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2025-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10844286/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832576794729906176 |
---|---|
author | Tianjian Yang You Chen Siyi Cheng Xing Wang Xi Zhang |
author_facet | Tianjian Yang You Chen Siyi Cheng Xing Wang Xi Zhang |
author_sort | Tianjian Yang |
collection | DOAJ |
description | In recent years, deep reinforcement learning (DRL) has made some progress in jamming type selection (JTS). However, during the training process of the agent, exploration of the action space is necessary, which leads to poor jamming effects in the early stages of training, posing a significant threat to the aircraft’s survivability. Therefore, this paper proposes a method of JTS based on generative adversarial imitation learning (GAIL). Firstly, the agent learns from expert strategies to achieve high reward returns, avoiding wasted time from unguided exploration, thereby ensuring that the agent maintains a good jamming effect throughout its application process. Secondly, based on generative adversarial theory, the discriminator measures the difference between the generated and expert strategies. This difference is used as an internal reward to assist in updating the neural network parameters, effectively reducing the complexity of reward function design. Finally, through case analysis, it can be observed that using the GAIL algorithm can achieve rewards close to those of the expert strategy. When applied online, it does not rely on accurate predictions or precise modeling of the external environment, allowing for quick real-time decision-making. Additionally, its performance surpasses that of traditional JTS strategies. |
format | Article |
id | doaj-art-ecbda6bd3183415a8033505ff527b390 |
institution | Kabale University |
issn | 2169-3536 |
language | English |
publishDate | 2025-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj-art-ecbda6bd3183415a8033505ff527b3902025-01-31T00:01:44ZengIEEEIEEE Access2169-35362025-01-0113171101711910.1109/ACCESS.2025.353101610844286Optimization of Jamming Type Selection for Countering Multifunction Radar Based on Generative Adversarial Imitation LearningTianjian Yang0https://orcid.org/0009-0007-8585-4994You Chen1Siyi Cheng2Xing Wang3Xi Zhang4Aviation Engineering School, Air Force Engineering University, Xi’an, ChinaAviation Engineering School, Air Force Engineering University, Xi’an, ChinaAviation Engineering School, Air Force Engineering University, Xi’an, ChinaAviation Engineering School, Air Force Engineering University, Xi’an, ChinaAviation Engineering School, Air Force Engineering University, Xi’an, ChinaIn recent years, deep reinforcement learning (DRL) has made some progress in jamming type selection (JTS). However, during the training process of the agent, exploration of the action space is necessary, which leads to poor jamming effects in the early stages of training, posing a significant threat to the aircraft’s survivability. Therefore, this paper proposes a method of JTS based on generative adversarial imitation learning (GAIL). Firstly, the agent learns from expert strategies to achieve high reward returns, avoiding wasted time from unguided exploration, thereby ensuring that the agent maintains a good jamming effect throughout its application process. Secondly, based on generative adversarial theory, the discriminator measures the difference between the generated and expert strategies. This difference is used as an internal reward to assist in updating the neural network parameters, effectively reducing the complexity of reward function design. Finally, through case analysis, it can be observed that using the GAIL algorithm can achieve rewards close to those of the expert strategy. When applied online, it does not rely on accurate predictions or precise modeling of the external environment, allowing for quick real-time decision-making. Additionally, its performance surpasses that of traditional JTS strategies.https://ieeexplore.ieee.org/document/10844286/Jamming type selectionreinforcement learningimitation learninggenerative adversarial network |
spellingShingle | Tianjian Yang You Chen Siyi Cheng Xing Wang Xi Zhang Optimization of Jamming Type Selection for Countering Multifunction Radar Based on Generative Adversarial Imitation Learning IEEE Access Jamming type selection reinforcement learning imitation learning generative adversarial network |
title | Optimization of Jamming Type Selection for Countering Multifunction Radar Based on Generative Adversarial Imitation Learning |
title_full | Optimization of Jamming Type Selection for Countering Multifunction Radar Based on Generative Adversarial Imitation Learning |
title_fullStr | Optimization of Jamming Type Selection for Countering Multifunction Radar Based on Generative Adversarial Imitation Learning |
title_full_unstemmed | Optimization of Jamming Type Selection for Countering Multifunction Radar Based on Generative Adversarial Imitation Learning |
title_short | Optimization of Jamming Type Selection for Countering Multifunction Radar Based on Generative Adversarial Imitation Learning |
title_sort | optimization of jamming type selection for countering multifunction radar based on generative adversarial imitation learning |
topic | Jamming type selection reinforcement learning imitation learning generative adversarial network |
url | https://ieeexplore.ieee.org/document/10844286/ |
work_keys_str_mv | AT tianjianyang optimizationofjammingtypeselectionforcounteringmultifunctionradarbasedongenerativeadversarialimitationlearning AT youchen optimizationofjammingtypeselectionforcounteringmultifunctionradarbasedongenerativeadversarialimitationlearning AT siyicheng optimizationofjammingtypeselectionforcounteringmultifunctionradarbasedongenerativeadversarialimitationlearning AT xingwang optimizationofjammingtypeselectionforcounteringmultifunctionradarbasedongenerativeadversarialimitationlearning AT xizhang optimizationofjammingtypeselectionforcounteringmultifunctionradarbasedongenerativeadversarialimitationlearning |