End-to-end GPU-accelerated Learning and Control for Robotics with Isaac Gym
at Robotics: Science and Systems, July 12, 2021
Required for participation in RSS workshops.
Specific to this workshop. Will be active starting on Monday, July 12 at 9:45am PT
In the last few years, reinforcement learning (RL) has become one of the most promising research areas in machine learning and has demonstrated great potential for solving sophisticated decision-making problems. Deep reinforcement learning has achieved superhuman performance in board games like Go, Poker, Dota 2 and Starcraft, as well as showing impressive results in robotics domains, including legged locomotion and dexterous manipulation.
There are two critical bottlenecks in the wider accessibility of RL technology in robotics: 1) enormous computational requirements and 2) limited simulation speed. These problems are especially challenging when learning long-horizon behaviors for robots with high degrees of freedom. To address these bottlenecks, NVIDIA has developed Isaac Gym, a high-performance robotics simulator that is completely free of charge. Its fully GPU-accelerated simulation and training pipeline can help lower the barrier for research, enabling the solution of tasks with a single GPU that were previously only possible on massive CPU clusters. For example, we show that training a quadruped ANYmal robot to walk in simulation takes under 30 min on a single A100 GPU and achieves successful policy deployment on a real robot. Moreover, we reproduce an OpenAI training setup with the Shadow Hand from the "Learning Dexterity" paper and are able to train it in less than 2 hours on an RTX 3090 GPU, compared to 40 hours on a CPU cluster with 6144 cores.
In this tutorial, we introduce the end-to-end GPU accelerated training pipeline in Isaac Gym, which allows researchers to overcome these key limitations and achieve a 100x-1000x training speed-up in continuous control tasks. Afterward, we perform a deep dive into Isaac Gym’s tensor API, upon which the GPU-accelerated training pipeline is built. Next, we demonstrate applications to various robotics domains through case studies, including manipulation with the Shadow Hand, operational space control with the Franka arm, and deformable-object grasping with the Franka gripper. Several of these case studies will be presented by guest speakers from university labs spanning robotics, machine learning, and simulation. Finally, we will host breakout sessions where participants can freely ask questions to organizers in their specific research area. The only prerequisites for our tutorial will be basic familiarity with Python, as well as fundamental robotics and machine learning concepts.
Organizers
Viktor Makoviychuk
NVIDIA Simulation Technology/Research
Gavriel State
NVIDIA Simulation Technology
Yashraj Narang
NVIDIA Research
Rika Antonova
Stanford
Yuke Zhu
UT Austin / NVIDIA Research
Josiah Wong
Stanford / NVIDIA Research
Ankur Handa
NVIDIA Research