Smart Sampling: Self-Attention and Bootstrapping for Improved Ensembled Q-Learning

We present a novel method aimed at enhancing the sample efficiency of ensemble Q learning. Our proposed approach integrates multi-head self-attention into the ensembled Q networks while bootstrapping the state-action pairs ingested by the ensemble. This not only results in performance improvements o...

Full description

Saved in:

Bibliographic Details
Main Authors:	Muhammad Junaid Khan, Syed Hammad Ahmed, Gita Sukthankar
Format:	Article
Language:	English
Published:	LibraryPress@UF 2024-05-01
Series:	Proceedings of the International Florida Artificial Intelligence Research Society Conference
Subjects:	sample efficient reinforcement learning ensemble learning bootstrapping multi-head self attention
Online Access:	https://journals.flvc.org/FLAIRS/article/view/135567
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	We present a novel method aimed at enhancing the sample efficiency of ensemble Q learning. Our proposed approach integrates multi-head self-attention into the ensembled Q networks while bootstrapping the state-action pairs ingested by the ensemble. This not only results in performance improvements over the original REDQ and its variant DroQ, thereby enhancing Q predictions, but also effectively reduces both the average normalized bias and standard deviation of normalized bias within Q-function ensembles. Importantly, our method also performs well even in scenarios with a low update-to-data (UTD) ratio. Notably, the implementation of our proposed method is straightforward, requiring minimal modifications to the base model.
ISSN:	2334-0754 2334-0762

Smart Sampling: Self-Attention and Bootstrapping for Improved Ensembled Q-Learning

Similar Items