Two-Stage Learning of CPG and Postural Reflex Toward Quadruped Locomotion on Uneven Terrain With Simple Reward
Locomotion control for quadruped robots has been extensively studied through various methodologies, including model-based, learning-based, and bio-inspired approaches. The integration of these methods is increasingly recognized as a powerful strategy for enhancing locomotion performance. Among them,...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/11036717/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Locomotion control for quadruped robots has been extensively studied through various methodologies, including model-based, learning-based, and bio-inspired approaches. The integration of these methods is increasingly recognized as a powerful strategy for enhancing locomotion performance. Among them, the Central Pattern Generator-Reinforcement Learning (CPG-RL) framework, which combines Central Pattern Generators (CPG) with reinforcement learning (RL), offers key advantages such as accelerated learning, improved Sim-to-Real transfer, and the ability to learn with a simplified reward function. However, its robustness on uneven terrains remains limited, as learning primarily focuses on tuning the parameters within the CPG. Specifically, because the foot placement is constrained to a pre-designed foot trajectory, the system is unable to stabilize its posture on uneven terrain. In biological systems, posture control is regulated by the brainstem and cerebellum, which send corrective signals to the spinal cord to modulate muscle activity and maintain balance in coordination with the CPG network. Inspired by this mechanism, this study introduces a postural reflex network that seamlessly integrates with the CPG network to enhance stable locomotion on uneven terrains. Notably, this improvement is achieved exclusively through piecewise sequential learning with a simplified reward function. The proposed approach expands the applicability of CPG-based locomotion under unstructured environments and contributes to the advancement of bio-inspired control strategies for quadrupedal locomotion. The proposed method progressively improves CPG-based locomotion on uneven terrain by learning the CPG parameters and the foot placement adjustments relative to the pre-designed trajectory in two-stage learning manner. |
|---|---|
| ISSN: | 2169-3536 |