Path Planning of Unmanned Helicopter in Complex Environment Based on Heuristic Deep Q-Network

Unmanned helicopters (UH) can evade radar detection by flying at ultralow altitudes, so as to conduct raids on targets. Path planning is one of the key technologies to realize UH’s autonomous completion of raid missions. Since the probability of UH being detected by radar varies with height, how to...

Full description

Saved in:

Bibliographic Details
Main Authors:	Jiangyi Yao, Xiongwei Li, Yang Zhang, Jingyu Ji, Yanchao Wang, Yicen Liu
Format:	Article
Language:	English
Published:	Wiley 2022-01-01
Series:	International Journal of Aerospace Engineering
Online Access:	http://dx.doi.org/10.1155/2022/1360956
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832565716954382336
author	Jiangyi Yao Xiongwei Li Yang Zhang Jingyu Ji Yanchao Wang Yicen Liu
author_facet	Jiangyi Yao Xiongwei Li Yang Zhang Jingyu Ji Yanchao Wang Yicen Liu
author_sort	Jiangyi Yao
collection	DOAJ
description	Unmanned helicopters (UH) can evade radar detection by flying at ultralow altitudes, so as to conduct raids on targets. Path planning is one of the key technologies to realize UH’s autonomous completion of raid missions. Since the probability of UH being detected by radar varies with height, how to accurately identify the radar coverage area to avoid crossing has become a difficult problem in UH path planning. Aiming at this problem, a heuristic deep Q-network (H-DQN) algorithm is proposed. First, as part of the comprehensive reward function, a heuristic reward function is designed. The function can generate dynamic rewards in real time according to the environmental information, so as to guide the UH to move closer to the target and at the same time promote the convergence of the algorithm. Second, in order to smooth the flight path, a smoothing reward function is proposed. This function can evaluate the pros and cons of UH’s actions, so as to prompt UH to choose a smoother path for flight. Finally, the heuristic reward function, the smooth reward function, the collision penalty, and the completion reward are weighted and summed to obtain the heuristic comprehensive reward function. Simulation experiments show that the H-DQN algorithm can help UH to effectively avoid the radar coverage area and successfully complete the raid mission.
format	Article
id	doaj-art-627bb9a6ccce44988c84235267424f84
institution	Kabale University
issn	1687-5974
language	English
publishDate	2022-01-01
publisher	Wiley
record_format	Article
series	International Journal of Aerospace Engineering
spelling	doaj-art-627bb9a6ccce44988c84235267424f842025-02-03T01:06:46ZengWileyInternational Journal of Aerospace Engineering1687-59742022-01-01202210.1155/2022/1360956Path Planning of Unmanned Helicopter in Complex Environment Based on Heuristic Deep Q-NetworkJiangyi Yao0Xiongwei Li1Yang Zhang2Jingyu Ji3Yanchao Wang4Yicen Liu5Equipment Simulation Training CenterEquipment Simulation Training CenterEquipment Simulation Training CenterDepartment of UAV EngineeringEquipment Simulation Training CenterState Key Laboratory of Blind Signal ProcessingUnmanned helicopters (UH) can evade radar detection by flying at ultralow altitudes, so as to conduct raids on targets. Path planning is one of the key technologies to realize UH’s autonomous completion of raid missions. Since the probability of UH being detected by radar varies with height, how to accurately identify the radar coverage area to avoid crossing has become a difficult problem in UH path planning. Aiming at this problem, a heuristic deep Q-network (H-DQN) algorithm is proposed. First, as part of the comprehensive reward function, a heuristic reward function is designed. The function can generate dynamic rewards in real time according to the environmental information, so as to guide the UH to move closer to the target and at the same time promote the convergence of the algorithm. Second, in order to smooth the flight path, a smoothing reward function is proposed. This function can evaluate the pros and cons of UH’s actions, so as to prompt UH to choose a smoother path for flight. Finally, the heuristic reward function, the smooth reward function, the collision penalty, and the completion reward are weighted and summed to obtain the heuristic comprehensive reward function. Simulation experiments show that the H-DQN algorithm can help UH to effectively avoid the radar coverage area and successfully complete the raid mission.http://dx.doi.org/10.1155/2022/1360956
spellingShingle	Jiangyi Yao Xiongwei Li Yang Zhang Jingyu Ji Yanchao Wang Yicen Liu Path Planning of Unmanned Helicopter in Complex Environment Based on Heuristic Deep Q-Network International Journal of Aerospace Engineering
title	Path Planning of Unmanned Helicopter in Complex Environment Based on Heuristic Deep Q-Network
title_full	Path Planning of Unmanned Helicopter in Complex Environment Based on Heuristic Deep Q-Network
title_fullStr	Path Planning of Unmanned Helicopter in Complex Environment Based on Heuristic Deep Q-Network
title_full_unstemmed	Path Planning of Unmanned Helicopter in Complex Environment Based on Heuristic Deep Q-Network
title_short	Path Planning of Unmanned Helicopter in Complex Environment Based on Heuristic Deep Q-Network
title_sort	path planning of unmanned helicopter in complex environment based on heuristic deep q network
url	http://dx.doi.org/10.1155/2022/1360956
work_keys_str_mv	AT jiangyiyao pathplanningofunmannedhelicopterincomplexenvironmentbasedonheuristicdeepqnetwork AT xiongweili pathplanningofunmannedhelicopterincomplexenvironmentbasedonheuristicdeepqnetwork AT yangzhang pathplanningofunmannedhelicopterincomplexenvironmentbasedonheuristicdeepqnetwork AT jingyuji pathplanningofunmannedhelicopterincomplexenvironmentbasedonheuristicdeepqnetwork AT yanchaowang pathplanningofunmannedhelicopterincomplexenvironmentbasedonheuristicdeepqnetwork AT yicenliu pathplanningofunmannedhelicopterincomplexenvironmentbasedonheuristicdeepqnetwork

Path Planning of Unmanned Helicopter in Complex Environment Based on Heuristic Deep Q-Network

Similar Items