Recent quadruped videos demonstrate agile jumping, stair climbing, and dynamic maneuvers that motivate this project’s performance goals for gait quality and robustness.
Many state-of-the-art locomotion policies are trained with deep reinforcement learning such as PPO in massively parallel GPU simulators like NVIDIA Isaac Gym and its modern successor Isaac Lab.
Note: Examples above are external or illustrative videos meant as inspiration; they showcase what robust locomotion policies and systems can achieve when trained and tuned extensively on Isaac Lab.
This is what an IsaacLab RL training looks like in the IsaacSim simulator: