A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task Allocation

Efficient task allocation remains a fundamental challenge in multi-agent systems, particularly under resource constraints and large-scale deployments. Classical methods, including market-based mechanisms, centralized optimization techniques, and game-theoretic strategies, have been widely applied to...

Full description

Saved in:
Bibliographic Details
Main Authors: Zimo Zhu, Chuanqiang Yu, Junti Wang
Format: Article
Language:English
Published: MDPI AG 2025-06-01
Series:Systems
Subjects:
Online Access:https://www.mdpi.com/2079-8954/13/6/453
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850164450705801216
author Zimo Zhu
Chuanqiang Yu
Junti Wang
author_facet Zimo Zhu
Chuanqiang Yu
Junti Wang
author_sort Zimo Zhu
collection DOAJ
description Efficient task allocation remains a fundamental challenge in multi-agent systems, particularly under resource constraints and large-scale deployments. Classical methods, including market-based mechanisms, centralized optimization techniques, and game-theoretic strategies, have been widely applied to address the multi-agent task allocation problem. While effective in small-to-medium-sized settings, these approaches often encounter limitations in terms of scalability, adaptability to dynamic environments, and computational efficiency as the problem size increases. To address these limitations, this study introduces a proximal policy optimization system augmented with a genetic algorithm (GAPPO) that integrates evolutionary search with deep reinforcement learning. GAPPO enables agents to develop energy-efficient task allocation strategies by perceiving environmental states and optimizing their actions through iterative policy updates. The genetic component promotes broader policy exploration beyond local optima, while the proximal policy optimization ensures update stability and sample efficiency. To evaluate the proposed GAPPO algorithm, extensive simulations are conducted across four scenarios, with the largest involving 50 tasks and 500 agents. The results demonstrate that GAPPO achieves superior performance compared to baseline methods, particularly in reducing task completion time. These findings highlight the algorithm’s robustness and efficiency in handling large-scale and computationally intensive coordination tasks.
format Article
id doaj-art-e1ca1219440246f5bd2a0a5fc39ecc65
institution OA Journals
issn 2079-8954
language English
publishDate 2025-06-01
publisher MDPI AG
record_format Article
series Systems
spelling doaj-art-e1ca1219440246f5bd2a0a5fc39ecc652025-08-20T02:21:58ZengMDPI AGSystems2079-89542025-06-0113645310.3390/systems13060453A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task AllocationZimo Zhu0Chuanqiang Yu1Junti Wang2Department of Vehicle Engineering, Rocket Force University of Engineering, Xi’an 710025, ChinaDepartment of Vehicle Engineering, Rocket Force University of Engineering, Xi’an 710025, ChinaDepartment of Vehicle Engineering, Rocket Force University of Engineering, Xi’an 710025, ChinaEfficient task allocation remains a fundamental challenge in multi-agent systems, particularly under resource constraints and large-scale deployments. Classical methods, including market-based mechanisms, centralized optimization techniques, and game-theoretic strategies, have been widely applied to address the multi-agent task allocation problem. While effective in small-to-medium-sized settings, these approaches often encounter limitations in terms of scalability, adaptability to dynamic environments, and computational efficiency as the problem size increases. To address these limitations, this study introduces a proximal policy optimization system augmented with a genetic algorithm (GAPPO) that integrates evolutionary search with deep reinforcement learning. GAPPO enables agents to develop energy-efficient task allocation strategies by perceiving environmental states and optimizing their actions through iterative policy updates. The genetic component promotes broader policy exploration beyond local optima, while the proximal policy optimization ensures update stability and sample efficiency. To evaluate the proposed GAPPO algorithm, extensive simulations are conducted across four scenarios, with the largest involving 50 tasks and 500 agents. The results demonstrate that GAPPO achieves superior performance compared to baseline methods, particularly in reducing task completion time. These findings highlight the algorithm’s robustness and efficiency in handling large-scale and computationally intensive coordination tasks.https://www.mdpi.com/2079-8954/13/6/453genetic algorithmmulti-agent systemsproximal policy optimizationreinforcement learningtask allocation
spellingShingle Zimo Zhu
Chuanqiang Yu
Junti Wang
A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task Allocation
Systems
genetic algorithm
multi-agent systems
proximal policy optimization
reinforcement learning
task allocation
title A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task Allocation
title_full A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task Allocation
title_fullStr A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task Allocation
title_full_unstemmed A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task Allocation
title_short A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task Allocation
title_sort hybrid genetic algorithm and proximal policy optimization system for efficient multi agent task allocation
topic genetic algorithm
multi-agent systems
proximal policy optimization
reinforcement learning
task allocation
url https://www.mdpi.com/2079-8954/13/6/453
work_keys_str_mv AT zimozhu ahybridgeneticalgorithmandproximalpolicyoptimizationsystemforefficientmultiagenttaskallocation
AT chuanqiangyu ahybridgeneticalgorithmandproximalpolicyoptimizationsystemforefficientmultiagenttaskallocation
AT juntiwang ahybridgeneticalgorithmandproximalpolicyoptimizationsystemforefficientmultiagenttaskallocation
AT zimozhu hybridgeneticalgorithmandproximalpolicyoptimizationsystemforefficientmultiagenttaskallocation
AT chuanqiangyu hybridgeneticalgorithmandproximalpolicyoptimizationsystemforefficientmultiagenttaskallocation
AT juntiwang hybridgeneticalgorithmandproximalpolicyoptimizationsystemforefficientmultiagenttaskallocation