Causal contextual bandits with one-shot data integration

We study a contextual bandit setting where the agent has access to causal side information, in addition to the ability to perform multiple targeted experiments corresponding to potentially different context-action pairs—simultaneously in one-shot within a budget. This new formalism provides a natura...

Full description

Saved in:
Bibliographic Details
Main Authors: Chandrasekar Subramanian, Balaraman Ravindran
Format: Article
Language:English
Published: Frontiers Media S.A. 2024-12-01
Series:Frontiers in Artificial Intelligence
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/frai.2024.1346700/full
Tags: Add Tag
No Tags, Be the first to tag this record!