Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradients☆
This paper tackles the problem of mitigating catastrophic risk (which is risk with very low frequency but very high severity) in the context of a sequential decision making process. This problem is particularly challenging due to the scarcity of observations in the far tail of the distribution of cu...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
KeAi Communications Co., Ltd.
2025-12-01
|
| Series: | Journal of Finance and Data Science |
| Subjects: | |
| Online Access: | http://www.sciencedirect.com/science/article/pii/S2405918825000170 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|