AIBPO: Combine the Intrinsic Reward and Auxiliary Task for 3D Strategy Game

In recent years, deep reinforcement learning (DRL) achieves great success in many fields, especially in the field of games, such as AlphaGo, AlphaZero, and AlphaStar. However, due to the reward sparsity problem, the traditional DRL-based method shows limited performance in 3D games, which contain mu...

Full description

Saved in:
Bibliographic Details
Main Authors: Huale Li, Rui Cao, Xuan Wang, Xiaohan Hou, Tao Qian, Fengwei Jia, Jiajia Zhang, Shuhan Qi
Format: Article
Language:English
Published: Wiley 2021-01-01
Series:Complexity
Online Access:http://dx.doi.org/10.1155/2021/6698231
Tags: Add Tag
No Tags, Be the first to tag this record!