Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID

n adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface vehicles (USV) is proposed in this paper. The gains of the PID controller need to be manually adjusted based on experience in the original formulatio...

Full description

Saved in:

Bibliographic Details
Main Authors:	Wei Guan, Zhaoyong Xi, Zhewen Cui, Xianku Zhang
Format:	Article
Language:	English
Published:	Faculty of Mechanical Engineering and Naval Architecture 2025-01-01
Series:	Brodogradnja
Subjects:	deep reinforcement learning unmanned surface vehicle soft actor critic pid tuning
Online Access:	https://hrcak.srce.hr/file/476944
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850117840703586304
author	Wei Guan Zhaoyong Xi Zhewen Cui Xianku Zhang
author_facet	Wei Guan Zhaoyong Xi Zhewen Cui Xianku Zhang
author_sort	Wei Guan
collection	DOAJ
description	n adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface vehicles (USV) is proposed in this paper. The gains of the PID controller need to be manually adjusted based on experience in the original formulation. Furthermore, once tuned, these gains remain fixed and making further modifications becomes time-consuming and labor-intensive. To address these limitations, the SAC algorithm is introduced, enabling online tuning of PID gains through agent-environment interaction. Additionally, the strategy of combining SAC algorithm with PID controller mitigates concerns regarding interpretability and security often associated with DRL. In this study, stability analysis of the adaptive trajectory controller based on the SAC-PID algorithm is conducted. This paper horizontally compares the proposed method with traditional PID tuning methods, genetic algorithms (GA), and deep deterministic policy gradient (DDPG) algorithm to highlight the superiority of the SAC-PID approach. Finally, experiments in different scenarios are performed to compare generalization capabilities between DDPG and SAC algorithms. Results demonstrate that the proposed SAC-PID algorithm exhibits excellent stability properties, fast convergence speed, and strong generalization ability.
format	Article
id	doaj-art-8ec3c4f0496546898e07784fae0d486c
institution	OA Journals
issn	0007-215X 1845-5859
language	English
publishDate	2025-01-01
publisher	Faculty of Mechanical Engineering and Naval Architecture
record_format	Article
series	Brodogradnja
spelling	doaj-art-8ec3c4f0496546898e07784fae0d486c2025-08-20T02:36:00ZengFaculty of Mechanical Engineering and Naval ArchitectureBrodogradnja0007-215X1845-58592025-01-0176212210.21278/brod76206Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PIDWei Guan0Zhaoyong Xi1Zhewen Cui2Xianku Zhang3Navigation College, Dalian Maritime University, Dalian, ChinaNavigation College, Dalian Maritime University, Dalian, ChinaNavigation College, Dalian Maritime University, Dalian, ChinaNavigation College, Dalian Maritime University, Dalian, Chinan adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface vehicles (USV) is proposed in this paper. The gains of the PID controller need to be manually adjusted based on experience in the original formulation. Furthermore, once tuned, these gains remain fixed and making further modifications becomes time-consuming and labor-intensive. To address these limitations, the SAC algorithm is introduced, enabling online tuning of PID gains through agent-environment interaction. Additionally, the strategy of combining SAC algorithm with PID controller mitigates concerns regarding interpretability and security often associated with DRL. In this study, stability analysis of the adaptive trajectory controller based on the SAC-PID algorithm is conducted. This paper horizontally compares the proposed method with traditional PID tuning methods, genetic algorithms (GA), and deep deterministic policy gradient (DDPG) algorithm to highlight the superiority of the SAC-PID approach. Finally, experiments in different scenarios are performed to compare generalization capabilities between DDPG and SAC algorithms. Results demonstrate that the proposed SAC-PID algorithm exhibits excellent stability properties, fast convergence speed, and strong generalization ability.https://hrcak.srce.hr/file/476944deep reinforcement learningunmanned surface vehiclesoft actor criticpid tuning
spellingShingle	Wei Guan Zhaoyong Xi Zhewen Cui Xianku Zhang Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID Brodogradnja deep reinforcement learning unmanned surface vehicle soft actor critic pid tuning
title	Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID
title_full	Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID
title_fullStr	Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID
title_full_unstemmed	Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID
title_short	Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID
title_sort	adaptive trajectory controller design for unmanned surface vehicles based on sac pid
topic	deep reinforcement learning unmanned surface vehicle soft actor critic pid tuning
url	https://hrcak.srce.hr/file/476944
work_keys_str_mv	AT weiguan adaptivetrajectorycontrollerdesignforunmannedsurfacevehiclesbasedonsacpid AT zhaoyongxi adaptivetrajectorycontrollerdesignforunmannedsurfacevehiclesbasedonsacpid AT zhewencui adaptivetrajectorycontrollerdesignforunmannedsurfacevehiclesbasedonsacpid AT xiankuzhang adaptivetrajectorycontrollerdesignforunmannedsurfacevehiclesbasedonsacpid

Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID

Similar Items