Learning the continuous-time optimal decision law from discrete-time rewards
The concept of reward is fundamental in reinforcement learning with a wide range of applications in natural and social sciences. Seeking an interpretable reward for decision-making that largely shapes the system's behavior has always been a challenge in reinforcement learning. In this work, we...
Saved in:
| Main Authors: | , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Science Press
2024-03-01
|
| Series: | National Science Open |
| Subjects: | |
| Online Access: | https://www.sciengine.com/doi/10.1360/nso/20230054 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|