LazyAct: Lazy actor with dynamic state skip based on constrained MDP.
Deep reinforcement learning has achieved significant success in complex decision-making tasks. However, the high computational cost of policies based on deep neural networks restricts their practical application. Specifically, each decision made by an agent requires a complete neural network computa...
Saved in:
Main Authors: | Hongjie Zhang, Zhenyu Chen, Hourui Deng, Chaosheng Feng |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2025-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0318778 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Vesicoureteral Reflux in the Child with Lazy Bladder Syndrome: The Infrequent Voider
by: Marco Grasso, et al.
Published: (2008-01-01) -
Tire Pressure Monitoring System Using Feature Fusion and Family of Lazy Classifiers
by: Arpit Pandey, et al.
Published: (2025-01-01) -
Tool Refactoring Otomatis untuk Menangani Lazy Class Code Smell dengan Pendekatan Software Metrics
by: Umi Sa'adah, et al.
Published: (2022-08-01) -
Evaluation of the Bond Strength of Self-Etching Adhesive Systems Containing HEMA and 10-MDP Monomers: Bond Strength of Adhesives Containing HEMA and 10-MDP
by: Roberta Pimentel de Oliveira, et al.
Published: (2022-01-01) -
Skipping Posterior Dynamic Transpedicular Stabilization for Distant Segment Degenerative Disease
by: Bilgehan Solmaz, et al.
Published: (2012-01-01)