arxiv.org

Real-World Humanoid Locomotion with Reinforcement Learning

Natural walking

Omnidirectional walking(전방향 보행)

Dynamic Arm Swing & Fast Walking

In-context adaptation

Emergent gait changes based on terrain

Emergent recovery from foot-trapping

Policy learning

Model architecture.

Teacher State-Policy Supervision

Joint optimization with reinforcement learning.