Adaptive dynamic programming optimal tracking control of multi-UAVs based on zero-sum game

This paper investigates an adaptive dynamic programming (ADP) optimal tracking control algorithm for multi-UAV systems based on zero-sum game theory, addressing external disturbances during flight. By formulating the bounded [Formula: see text] gain problem as a two-person zero-sum game between cont...

Full description

Saved in:
Bibliographic Details
Main Authors: Shuaibin Guan, Xingjian Fu
Format: Article
Language:English
Published: Taylor & Francis Group 2025-07-01
Series:Automatika
Subjects:
Online Access:https://www.tandfonline.com/doi/10.1080/00051144.2025.2497616
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper investigates an adaptive dynamic programming (ADP) optimal tracking control algorithm for multi-UAV systems based on zero-sum game theory, addressing external disturbances during flight. By formulating the bounded [Formula: see text] gain problem as a two-person zero-sum game between control strategies and external disturbances, the Hamilton-Jacobi-Isaacs (HJI) equation is constructed to derive the Nash equilibrium solution. To overcome the computational challenges of solving the HJI equation, a three-layer neural network structure comprising evaluation, execution, and disturbance networks is employed, integrated with the ADP algorithm to approximate the value function and optimize the control strategy iteratively. Comprehensive simulations demonstrate the proposed method's superior trajectory tracking performance and robustness compared to Sliding Mode Control (SMC). The results confirm the effectiveness of the ADP-based approach in achieving real-time, adaptive control in complex and dynamic multi-UAV environments.
ISSN:0005-1144
1848-3380