Manipulation Datasets

The largest category in robot learning data. Curated open-source and custom manipulation datasets spanning tabletop pick-and-place, contact-rich assembly, deformable objects, and tool use — everything a manipulation policy needs to generalize.

Key Open-Source Manipulation Datasets

DatasetEpisodesRobot(s)Format
DROID76K trajectoriesFranka FR3RLDS, HDF5
BridgeData v260K+WidowX-250RLDS, HDF5
Open X-Embodiment1M+22 robot typesRLDS
RoboMimicVaries per taskFranka (sim + real)HDF5
MimicGen50K+ (generated)Franka (sim)HDF5
LIBERO65K demos, 130 tasksRoboSuite simHDF5
OpenArm DatasetsGrowing collectionOpenArmLeRobot, HDF5

Choosing the Right Dataset

For VLA pretraining, start with Open X-Embodiment or DROID — they cover the broadest range of tasks, environments, and embodiments. For single-task fine-tuning, BridgeData v2 and RoboMimic provide cleaner, more focused task suites. For data augmentation, MimicGen can expand a small seed set by 100x through automated trajectory generation.

If you need data on a specific robot or task that does not exist in any open dataset, SVRC can collect custom manipulation datasets using our teleoperation infrastructure. We operate OpenArm, Franka, UR, and xArm collection stations.

Format Compatibility

Most modern manipulation datasets are available in multiple formats. Use our dataset comparison tools to understand schema differences. The Fearless Data Platform can convert between HDF5, RLDS, and LeRobot formats automatically.

Need custom manipulation data?

Commission teleoperation data on your target robot and task. We deliver in your format of choice.