A UAV Pursuit-Evasion Strategy Based on DDPG and Imitation Learning

The UAV pursuit-evasion strategy based on Deep Deterministic Policy Gradient (DDPG) algorithm is a current research hotspot. However, this algorithm has the defect of low efficiency in sample exploration. To solve this problem, this paper uses the imitation learning (IL) to improve the DDPG explorat...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiaowei Fu, Jindong Zhu, Zhaoying Wei, Hui Wang, Sili Li
Format: Article
Language:English
Published: Wiley 2022-01-01
Series:International Journal of Aerospace Engineering
Online Access:http://dx.doi.org/10.1155/2022/3139610
Tags: Add Tag
No Tags, Be the first to tag this record!