A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task Allocation
Efficient task allocation remains a fundamental challenge in multi-agent systems, particularly under resource constraints and large-scale deployments. Classical methods, including market-based mechanisms, centralized optimization techniques, and game-theoretic strategies, have been widely applied to...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-06-01
|
| Series: | Systems |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2079-8954/13/6/453 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850164450705801216 |
|---|---|
| author | Zimo Zhu Chuanqiang Yu Junti Wang |
| author_facet | Zimo Zhu Chuanqiang Yu Junti Wang |
| author_sort | Zimo Zhu |
| collection | DOAJ |
| description | Efficient task allocation remains a fundamental challenge in multi-agent systems, particularly under resource constraints and large-scale deployments. Classical methods, including market-based mechanisms, centralized optimization techniques, and game-theoretic strategies, have been widely applied to address the multi-agent task allocation problem. While effective in small-to-medium-sized settings, these approaches often encounter limitations in terms of scalability, adaptability to dynamic environments, and computational efficiency as the problem size increases. To address these limitations, this study introduces a proximal policy optimization system augmented with a genetic algorithm (GAPPO) that integrates evolutionary search with deep reinforcement learning. GAPPO enables agents to develop energy-efficient task allocation strategies by perceiving environmental states and optimizing their actions through iterative policy updates. The genetic component promotes broader policy exploration beyond local optima, while the proximal policy optimization ensures update stability and sample efficiency. To evaluate the proposed GAPPO algorithm, extensive simulations are conducted across four scenarios, with the largest involving 50 tasks and 500 agents. The results demonstrate that GAPPO achieves superior performance compared to baseline methods, particularly in reducing task completion time. These findings highlight the algorithm’s robustness and efficiency in handling large-scale and computationally intensive coordination tasks. |
| format | Article |
| id | doaj-art-e1ca1219440246f5bd2a0a5fc39ecc65 |
| institution | OA Journals |
| issn | 2079-8954 |
| language | English |
| publishDate | 2025-06-01 |
| publisher | MDPI AG |
| record_format | Article |
| series | Systems |
| spelling | doaj-art-e1ca1219440246f5bd2a0a5fc39ecc652025-08-20T02:21:58ZengMDPI AGSystems2079-89542025-06-0113645310.3390/systems13060453A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task AllocationZimo Zhu0Chuanqiang Yu1Junti Wang2Department of Vehicle Engineering, Rocket Force University of Engineering, Xi’an 710025, ChinaDepartment of Vehicle Engineering, Rocket Force University of Engineering, Xi’an 710025, ChinaDepartment of Vehicle Engineering, Rocket Force University of Engineering, Xi’an 710025, ChinaEfficient task allocation remains a fundamental challenge in multi-agent systems, particularly under resource constraints and large-scale deployments. Classical methods, including market-based mechanisms, centralized optimization techniques, and game-theoretic strategies, have been widely applied to address the multi-agent task allocation problem. While effective in small-to-medium-sized settings, these approaches often encounter limitations in terms of scalability, adaptability to dynamic environments, and computational efficiency as the problem size increases. To address these limitations, this study introduces a proximal policy optimization system augmented with a genetic algorithm (GAPPO) that integrates evolutionary search with deep reinforcement learning. GAPPO enables agents to develop energy-efficient task allocation strategies by perceiving environmental states and optimizing their actions through iterative policy updates. The genetic component promotes broader policy exploration beyond local optima, while the proximal policy optimization ensures update stability and sample efficiency. To evaluate the proposed GAPPO algorithm, extensive simulations are conducted across four scenarios, with the largest involving 50 tasks and 500 agents. The results demonstrate that GAPPO achieves superior performance compared to baseline methods, particularly in reducing task completion time. These findings highlight the algorithm’s robustness and efficiency in handling large-scale and computationally intensive coordination tasks.https://www.mdpi.com/2079-8954/13/6/453genetic algorithmmulti-agent systemsproximal policy optimizationreinforcement learningtask allocation |
| spellingShingle | Zimo Zhu Chuanqiang Yu Junti Wang A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task Allocation Systems genetic algorithm multi-agent systems proximal policy optimization reinforcement learning task allocation |
| title | A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task Allocation |
| title_full | A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task Allocation |
| title_fullStr | A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task Allocation |
| title_full_unstemmed | A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task Allocation |
| title_short | A Hybrid Genetic Algorithm and Proximal Policy Optimization System for Efficient Multi-Agent Task Allocation |
| title_sort | hybrid genetic algorithm and proximal policy optimization system for efficient multi agent task allocation |
| topic | genetic algorithm multi-agent systems proximal policy optimization reinforcement learning task allocation |
| url | https://www.mdpi.com/2079-8954/13/6/453 |
| work_keys_str_mv | AT zimozhu ahybridgeneticalgorithmandproximalpolicyoptimizationsystemforefficientmultiagenttaskallocation AT chuanqiangyu ahybridgeneticalgorithmandproximalpolicyoptimizationsystemforefficientmultiagenttaskallocation AT juntiwang ahybridgeneticalgorithmandproximalpolicyoptimizationsystemforefficientmultiagenttaskallocation AT zimozhu hybridgeneticalgorithmandproximalpolicyoptimizationsystemforefficientmultiagenttaskallocation AT chuanqiangyu hybridgeneticalgorithmandproximalpolicyoptimizationsystemforefficientmultiagenttaskallocation AT juntiwang hybridgeneticalgorithmandproximalpolicyoptimizationsystemforefficientmultiagenttaskallocation |