108 Public Robotics Datasets

Search, preview, and download robotics data — manipulation, locomotion, tactile sensing, motion capture, and more. License-aware access: we respect every dataset's terms.

108 Total
26 Open License
2 CC-BY
22 Link-Only
42 Unknown

Popular Categories

Popular Tags

License
Category
Tag

Datasets for Robot Learning

Each dataset has a dedicated page with description, scale, access links, license badges, and citations.

DROID dataset capture workflow
RSS 2024
ResearchLink Only

DROID

76K trajectories, 350 hours, 86 tasks. In-the-wild manipulation from 50 collectors across 564 scenes.

View dataset → Official source ↗
BridgeData V2 engineering data setup
2023
MITDownload

BridgeData v2

60K trajectories, 24 environments, 13 manipulation skills. Low-cost WidowX robot. Natural language labels.

View dataset → Open in Platform →
Open X-Embodiment multi-robot data processing
Google DeepMind
CC-BYDownload

Open X-Embodiment

1M+ episodes, 22 robot types, 500+ skills. Unified RLDS format. RT-X models. 33 institutions. Per-subset licenses vary.

View dataset → Open in Platform →
ALOHA teleoperation platform image
Stanford / NVIDIA
Apache-2.0Download

ALOHA

Bimanual teleoperation. ALOHA-Cosmos-Policy, baseline datasets. HDF5, Hugging Face. Open hardware.

View dataset → Open in Platform →
LIBERO benchmark planning workflow
Benchmark
MITDownload

LIBERO

130 tasks, 65K demos. Lifelong learning benchmark. Spatial, object, goal suites. RoboSuite simulation.

View dataset → Open in Platform →
RoboNet multi-platform robotics scene
Stanford / Berkeley
MITDownload

RoboNet

15M frames, 7 robot platforms. Multi-robot transfer. Sawyer, Franka, Baxter, Fetch, WidowX.

View dataset → Official source ↗
OpenArm open-hardware bimanual teleoperation platform
Reazon Research
Apache-2.0Download

OpenArm

Open-hardware bimanual manipulation platform with reference teleoperation datasets. Reproducible build, low-cost teleop.

View dataset → Official source ↗
MimicGen demonstration synthesis pipeline
NVIDIA
MITDownload

MimicGen

50K+ demos synthesized from ~200 human demos. Task suites for robust imitation learning in simulation and real.

View dataset → Official source ↗
RoboMimic dataset and pipeline image
ARISE Initiative
MITDownload

RoboMimic

Framework + reference datasets for learning from human demonstrations. Simulation + real. MIT license.

View dataset → Official source ↗
RT-X cross-embodiment foundation model training
Google DeepMind
CC-BYDownload

RT-X

Cross-embodiment RT-1-X / RT-2-X policy training data, derived from Open X-Embodiment. Foundation-model scale.

View dataset → Official source ↗
LeRobot open robotics dataset ecosystem
Hugging Face
Apache-2.0Download

LeRobot

Standardized format + hub. DROID-100, ALOHA, SO-100. PyTorch, streaming. "ImageNet of robotics."

View dataset →

Models & Tools You Can Pair

Research-Ready Curation

We highlight scale, format, and access details needed for quick evaluation.

Cross-Stack Compatibility

Datasets are mapped to practical model and tool ecosystems.

Deployment Context

Dataset choices are linked with real robot execution constraints.

Scale-up Path

When open data is not enough, we support custom collection pipelines.

Need Custom Data?

We collect high-quality, learning-ready data for your specific tasks and hardware.