RoboCasa: Kitchen Manipulation Dataset

Large-scale simulation data for household robot policies. 536K+ downloads. MIT licensed and free to use.

MIT -- Open Parquet Simulation

Key Stats

MetricValue
Size~20 GB (pretrain); multiple variants total 100+ GB
Downloads536K+ (one of the most downloaded robotics datasets)
TasksKitchen and household manipulation (cooking, cleaning, organizing)
FormatParquet (pretrain); Parquet + MP4 (LeRobot variants)
ModalitiesRGB images, joint state, actions
LicenseMIT
Variants406 total RoboCasa datasets on HuggingFace

What is RoboCasa?

RoboCasa is a large-scale simulation environment and dataset suite designed specifically for household robotics. It provides diverse, realistic kitchen environments populated with everyday objects and task scenarios covering cooking, cleaning, and organization. The pretraining dataset (robocasa-pretrain) has become one of the most downloaded robotics datasets on HuggingFace with over 536K downloads.

The RoboCasa ecosystem includes several variants: the base pretrain dataset, NVIDIA's RoboCasa-Cosmos-Policy (2.98M rows formatted for Cosmos world model training), and LeRobot-compatible versions (robocasa_target_human_unified with 15M rows). There are 406 total RoboCasa-related datasets on HuggingFace.

RoboCasa is particularly valuable as a pretraining corpus for kitchen manipulation policies that will later be fine-tuned on real-world data.

Related datasets

From sim to real

Pretrain on RoboCasa, then fine-tune with real-world data from our collection service.