Two-Stage Learning of CPG and Postural Reflex Toward Quadruped Locomotion on Uneven Terrain With Simple Reward

Locomotion control for quadruped robots has been extensively studied through various methodologies, including model-based, learning-based, and bio-inspired approaches. The integration of these methods is increasingly recognized as a powerful strategy for enhancing locomotion performance. Among them,...

Full description

Saved in:

Bibliographic Details
Main Authors:	Ryosei Seto, Guanda Li, Kyo Kutsuzawa, Dai Owaki, Mitsuhiro Hayashibe
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Central pattern generator deep reinforcement learning postural reflex quadrupedal robots
Online Access:	https://ieeexplore.ieee.org/document/11036717/
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Locomotion control for quadruped robots has been extensively studied through various methodologies, including model-based, learning-based, and bio-inspired approaches. The integration of these methods is increasingly recognized as a powerful strategy for enhancing locomotion performance. Among them, the Central Pattern Generator-Reinforcement Learning (CPG-RL) framework, which combines Central Pattern Generators (CPG) with reinforcement learning (RL), offers key advantages such as accelerated learning, improved Sim-to-Real transfer, and the ability to learn with a simplified reward function. However, its robustness on uneven terrains remains limited, as learning primarily focuses on tuning the parameters within the CPG. Specifically, because the foot placement is constrained to a pre-designed foot trajectory, the system is unable to stabilize its posture on uneven terrain. In biological systems, posture control is regulated by the brainstem and cerebellum, which send corrective signals to the spinal cord to modulate muscle activity and maintain balance in coordination with the CPG network. Inspired by this mechanism, this study introduces a postural reflex network that seamlessly integrates with the CPG network to enhance stable locomotion on uneven terrains. Notably, this improvement is achieved exclusively through piecewise sequential learning with a simplified reward function. The proposed approach expands the applicability of CPG-based locomotion under unstructured environments and contributes to the advancement of bio-inspired control strategies for quadrupedal locomotion. The proposed method progressively improves CPG-based locomotion on uneven terrain by learning the CPG parameters and the foot placement adjustments relative to the pre-designed trajectory in two-stage learning manner.
ISSN:	2169-3536

Two-Stage Learning of CPG and Postural Reflex Toward Quadruped Locomotion on Uneven Terrain With Simple Reward

Similar Items