Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradients☆

This paper tackles the problem of mitigating catastrophic risk (which is risk with very low frequency but very high severity) in the context of a sequential decision making process. This problem is particularly challenging due to the scarcity of observations in the far tail of the distribution of cu...

Full description

Saved in:
Bibliographic Details
Main Authors: Parisa Davar, Frédéric Godin, Jose Garrido
Format: Article
Language:English
Published: KeAi Communications Co., Ltd. 2025-12-01
Series:Journal of Finance and Data Science
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2405918825000170
Tags: Add Tag
No Tags, Be the first to tag this record!