Accelerated Transfer Learning for Cooperative Transportation Formation Change via SDPA-MAPPO (Scaled Dot Product Attention-Multi-Agent Proximal Policy Optimization)

A method for cooperative transportation, which required formation change in a traveling environment, is gaining interest. Deep reinforcement learning is used in formation changes for multi-robot cases. The MADDPG (Multi-Agent Deep Deterministic Policy Gradient) method is popularly used for recognize...

Full description

Saved in:
Bibliographic Details
Main Authors: Almira Budiyanto, Keisuke Azetsu, Nobutomo Matsunaga
Format: Article
Language:English
Published: MDPI AG 2024-11-01
Series:Automation
Subjects:
Online Access:https://www.mdpi.com/2673-4052/5/4/34
Tags: Add Tag
No Tags, Be the first to tag this record!