Smart Sampling: Self-Attention and Bootstrapping for Improved Ensembled Q-Learning

We present a novel method aimed at enhancing the sample efficiency of ensemble Q learning. Our proposed approach integrates multi-head self-attention into the ensembled Q networks while bootstrapping the state-action pairs ingested by the ensemble. This not only results in performance improvements o...

Full description

Saved in:
Bibliographic Details
Main Authors: Muhammad Junaid Khan, Syed Hammad Ahmed, Gita Sukthankar
Format: Article
Language:English
Published: LibraryPress@UF 2024-05-01
Series:Proceedings of the International Florida Artificial Intelligence Research Society Conference
Subjects:
Online Access:https://journals.flvc.org/FLAIRS/article/view/135567
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We present a novel method aimed at enhancing the sample efficiency of ensemble Q learning. Our proposed approach integrates multi-head self-attention into the ensembled Q networks while bootstrapping the state-action pairs ingested by the ensemble. This not only results in performance improvements over the original REDQ and its variant DroQ, thereby enhancing Q predictions, but also effectively reduces both the average normalized bias and standard deviation of normalized bias within Q-function ensembles. Importantly, our method also performs well even in scenarios with a low update-to-data (UTD) ratio. Notably, the implementation of our proposed method is straightforward, requiring minimal modifications to the base model.
ISSN:2334-0754
2334-0762