HPRS: hierarchical potential-based reward shaping from task specifications

The automatic synthesis of policies for robotics systems through reinforcement learning relies upon, and is intimately guided by, a reward signal. Consequently, this signal should faithfully reflect the designer’s intentions, which are often expressed as a collection of high-level requirements. Seve...

Full description

Saved in:

Bibliographic Details
Main Authors:	Luigi Berducci, Edgar A. Aguilar, Dejan Ničković, Radu Grosu
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2025-02-01
Series:	Frontiers in Robotics and AI
Subjects:	robotics robot learning reinforcement learning reward shaping formal specifications
Online Access:	https://www.frontiersin.org/articles/10.3389/frobt.2024.1444188/full
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://www.frontiersin.org/articles/10.3389/frobt.2024.1444188/full

HPRS: hierarchical potential-based reward shaping from task specifications

Internet

Similar Items