Comparative Evaluation of Mean Cumulative Regret in Multi-Armed Bandit Algorithms: ETC, UCB, Asymptotically Optimal UCB, and TS

Comparative Evaluation of Mean Cumulative Regret in Multi-Armed Bandit Algorithms: ETC, UCB, Asymptotically Optimal UCB, and TS

This research provides insights into how to address short-term and long-term decision-making in different kinds of the Multi-Armed Bandit (MAB) problem, a classic problem in decision-making under uncertainty. In this study, four algorithms - Explore-Then-Commit (ETC), the Upper Confidence Bound (UCB...

Full description

Saved in:

Bibliographic Details
Main Author:	Lei Yicong
Format:	Article
Language:	English
Published:	EDP Sciences 2025-01-01
Series:	ITM Web of Conferences
Online Access:	https://www.itm-conferences.org/articles/itmconf/pdf/2025/04/itmconf_iwadi2024_01026.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Nonstationary Stochastic Bandits: UCB Policies and Minimax Regret
by: Lai Wei, et al.
Published: (2024-01-01)

Numerical analysis of springback with experimental validation using UCB test
by: Rogério Lopes, et al.
Published: (2024-01-01)

YOLOv8-UCB: Visual Detection of Pouch Battery Using Improved YOLOv8
by: Hao Hao, et al.
Published: (2024-01-01)

MSC-EVs and UCB-EVs promote skin wound healing and spatial transcriptome analysis
by: Ruonan Li, et al.
Published: (2025-02-01)

Client aware adaptive federated learning using UCB-based reinforcement for people re-identification
by: Dinah Waref, et al.
Published: (2025-05-01)

Efficient Chlorophyll Prediction and Sampling in the Sea: A Real-Time Approach With UCB-Based Path Planning
by: Perihan Karakose, et al.
Published: (2025-01-01)

Multi-Dimensional Arms for Combinatorial Multi-Armed Bandit
by: Qi Li, et al.
Published: (2025-01-01)

Optimizing Data Filtering in Multi-Armed Bandit Algorithms for Reinforcement Learning
by: Zhang Shengshi
Published: (2025-01-01)

Research on the Multi-Armed Bandit Algorithm in Path Planning for Autonomous Vehicles
by: Li Jingyu
Published: (2025-01-01)

A Review of Multi-Armed Bandit Algorithms in Player Modeling and Game Design
by: Zhan Xizhi
Published: (2025-01-01)

Fair Probabilistic Multi-Armed Bandit With Applications to Network Optimization
by: Zhiwu Guo, et al.
Published: (2024-01-01)

Adaptive Noise Exploration for Neural Contextual Multi-Armed Bandits
by: Chi Wang, et al.
Published: (2025-01-01)

Cost-Efficient Distributed Learning via Combinatorial Multi-Armed Bandits
by: Maximilian Egger, et al.
Published: (2025-05-01)

Combining Multiple Strategies for Multiarmed Bandit Problems and Asymptotic Optimality
by: Hyeong Soo Chang, et al.
Published: (2015-01-01)

Asymptotic symmetries in the $TsT/T\bar{T}$ correspondence
by: Zhengyuan Du, Wen-Xin Lai, Kangning Liu, Wei Song
Published: (2025-02-01)

Multi armed bandit based resource allocation in Near Memory Processing architectures
by: Shubhang Pandey, et al.
Published: (2025-12-01)

Mating with Multi-Armed Bandits: Reinforcement Learning Models of Human Mate Search
by: Daniel Conroy-Beam
Published: (2024-08-01)

Modified Index Policies for Multi-Armed Bandits with Network-like Markovian Dependencies
by: Abdalaziz Sawwan, et al.
Published: (2025-01-01)

Gaussian Process with Vine Copula-Based Context Modeling for Contextual Multi-Armed Bandits
by: Jong-Min Kim
Published: (2025-06-01)

A multi-armed bandits empowered transmission scheme for IRS-assisted MISO system
by: SONG Yunchao, et al.
Published: (2025-03-01)

Client Selection for Generalization in Accelerated Federated Learning: A Multi-Armed Bandit Approach
by: Dan Ben Ami, et al.
Published: (2025-01-01)

Designing digital health interventions with causal inference and multi-armed bandits: a review
by: Radoslava Švihrová, et al.
Published: (2025-06-01)

Cooperate or Not Cooperate: Transfer Learning With Multi-Armed Bandit for Spatial Reuse in Wi-Fi
by: Pedro Enrique Iturria-Rivera, et al.
Published: (2024-01-01)

Reducing Computational Time in Pixel-Based Path Planning for GMA-DED by Using Multi-Armed Bandit Reinforcement Learning Algorithm
by: Rafael P. Ferreira, et al.
Published: (2025-03-01)

Active Inference-Driven Multi-Armed Bandits: Superior Performance through Dynamic Correlation Adjustments
by: Lin Xiaoqi
Published: (2025-01-01)

Multi-Armed Bandit Approaches for Location Planning with Dynamic Relief Supplies Allocation Under Disaster Uncertainty
by: Jun Liang, et al.
Published: (2024-12-01)

A Hybrid Proactive Caching System in Vehicular Networks Based on Contextual Multi-Armed Bandit Learning
by: Qiao Wang, et al.
Published: (2023-01-01)

An asymptotic expansion of the expected regret risk of classification under double inverse sampling scheme
by: Kęstutis Dučinskas
Published: (1997-12-01)

Presentació: Un certain Calvino, etc... etc...
Published: (2025-04-01)

Adaptive PPO With Multi-Armed Bandit Clipping and Meta-Control for Robust Power Grid Operation Under Adversarial Attacks
by: Mohamed Massaoudi, et al.
Published: (2025-01-01)

BiDir-GRCO: A Bidirectional General Reaction Conditions Optimization Framework Integrating Multi-Armed Bandit and Regression Model
by: Quan Jiang, et al.
Published: (2025-08-01)

AI-Driven Nudge Optimization: Integrating Two-Tower Networks and Multi-Armed Bandit With Behavioral Economics for Digital Banking Campaign
by: Idha Kristiana, et al.
Published: (2025-01-01)

Optimistic Algorithms for Safe Linear Bandits Under General Constraints
by: Spencer Hutchinson, et al.
Published: (2025-01-01)

Deciphering algorithmic collusion: Insights from bandit algorithms and implications for antitrust enforcement
by: Frédéric Marty, et al.
Published: (2025-11-01)

tDCS over VLPFC modulates the exploit-explore tradeoff in a two-armed bandit task
by: Bart Krekelberg, et al.
Published: (2025-01-01)

Bandit Algorithms for Efficient Toxicity Detection in Competitive Online Video Games
by: Jacob Morrier, et al.
Published: (2025-01-01)

Periods, Capitalized Words, etc.
by: Andrei Mikheev
Published: (2021-03-01)

Divided Agency, Manipulation, and Regret
by: Jonathan D Payton
Published: (2024-11-01)

Subprime Risk and Insurance with Regret
by: M. A. Petersen, et al.
Published: (2010-01-01)

Regret Averse Opinion Aggregation
by: Lee Elkin
Published: (2021-12-01)