NVIDIA GR00T Teleop Datasets
Humanoid robot teleoperation data for the GR00T foundation model. Real + sim, multiple embodiments, 13M+ total rows.
Dataset Variants
| Variant | Robot | Rows | Size | Downloads |
|---|---|---|---|---|
| GR00T Teleop GR1 | NVIDIA GR1 humanoid | 7.55M | ~50 GB | 65.2K |
| GR00T Teleop Sim | Simulated humanoid | 5.82M | ~40 GB | 6.5K |
| GR00T Teleop G1 | Unitree G1 humanoid | 124K | ~2 GB | 526 |
| GR00T X-Embodiment Sim | Multiple (sim) | 345K | ~5 GB | -- |
| Manipulation Kitchen | Robot arm | 405K | ~5 GB | 1.5K |
| Open-H-Embodiment | Humanoid (cross) | -- | ~0.1 GB | 36.9K |
| GraspGen | Hand/gripper | 25.5K | ~3 GB | 1K |
What is NVIDIA GR00T?
GR00T (Generalist Robot 00 Technology) is NVIDIA's foundation model initiative for humanoid robots. The GR00T teleop datasets are the real-world and simulated teleoperation recordings used to train these models. The GR1 variant alone contains 7.55 million rows of humanoid teleoperation data, making it the most downloaded NVIDIA robotics dataset with 65.2K downloads.
The collection spans multiple embodiments (NVIDIA GR1, Unitree G1, simulated humanoids) and tasks (manipulation, kitchen interactions, grasping), providing a cross-embodiment foundation for humanoid policy learning. NVIDIA also released Cosmos world model integrations with RoboCasa and LIBERO environments, bridging the gap between simulation and real-world deployment.
Access
License note: NVIDIA Open Model License allows redistribution with attribution but restricts use for competing model development. Review full terms before commercial use.
Related datasets
- Unitree G1 Datasets -- more G1 humanoid data from Unitree
- Open X-Embodiment -- cross-embodiment pretraining corpus
- RoboCasa -- kitchen manipulation (Cosmos integration available)
- AMASS -- motion capture data for humanoid animation