Bias Adaptive Statistical Inference Learning Agents for Learning from Human Feedback
We present a novel technique for learning behaviors from ahuman provided feedback signal that is distorted by system-atic bias. Our technique, which we refer to as BASIL, modelsthe feedback signal as being separable into a heuristic evalu-ation of the utility of an action and a bias value that is dr...
Saved in:
| Main Author: | Jonathan I Watson |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
LibraryPress@UF
2021-04-01
|
| Series: | Proceedings of the International Florida Artificial Intelligence Research Society Conference |
| Subjects: | |
| Online Access: | https://journals.flvc.org/FLAIRS/article/view/128471 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Goal-oriented autonomous decision-making for social robots via collaborative interactive inverse reinforcement learning approach
by: Mingyue Luo, et al.
Published: (2025-07-01) -
Stylistic study of uniformity of narrative language in the Tigers on the tenth day by Zakaria Tamer
by: Ali Bayanlou
Published: (2013-06-01) -
Case Report: Tetris Ball In The Left Atrium
by: Uğur Altun, et al.
Published: (2025-04-01) -
Influencing Reinforcement Learning through Natural Language Guidance
by: Tasmia Tasrin, et al.
Published: (2021-04-01) -
Robust Tracking Control of Underactuated UAVs Based on Zero-Sum Differential Games
by: Yaning Guo, et al.
Published: (2025-07-01)