DTPPO: Dual-Transformer Encoder-Based Proximal Policy Optimization for Multi-UAV Navigation in Unseen Complex Environments

Existing multi-agent deep reinforcement learning (MADRL) methods for multi-UAV navigation face challenges in generalization, particularly when applied to unseen complex environments. To address these limitations, we propose a Dual-Transformer Encoder-Based Proximal Policy Optimization (<i>DTPP...

Full description

Saved in:
Bibliographic Details
Main Authors: Anning Wei, Jintao Liang, Kaiyuan Lin, Ziyue Li, Rui Zhao
Format: Article
Language:English
Published: MDPI AG 2024-11-01
Series:Drones
Subjects:
Online Access:https://www.mdpi.com/2504-446X/8/12/720
Tags: Add Tag
No Tags, Be the first to tag this record!