Research Stack

The Best Open-Source Robotics Stack for Research Labs (2026 Edition)

In 2026 you can run a credible robot-learning research program entirely on open-source software, with a small hardware bill and a single GPU. This is the stack we recommend — the pieces that fit together, where they do not, and how to compare against the closed NVIDIA alternative.

Published 2026-03-13 by the Robotics Center of Silicon Valley research team.

TL;DR. The 2026 open-source robotics stack that actually fits together: LeRobot for data and policies, Robosuite or Isaac Lab for simulation, OpenVLA or Octo for foundation models, ROS 2 Jazzy for the middleware, DROID-style dataset conventions for interoperability, and GelSight-class optical tactile sensors for contact-rich research. The closed alternative is the NVIDIA Isaac stack plus proprietary VLAs. Choose open for reproducibility, cost, and portability. Choose closed only when you need the integration is deeper than your team can maintain.

1. The stack at a glance

Layer	Open-source default	Closed alternative	When closed wins
Middleware / runtime	ROS 2 Jazzy	Isaac ROS (NVIDIA)	Heavy reliance on NVIDIA Jetson perception
Data format & training	LeRobot	Proprietary (internal)	Closed labs, proprietary embodiments
Simulation	Robosuite, Isaac Lab (partly open)	Isaac Sim full stack	Large-scale RL, high-fidelity physics
Foundation policy	OpenVLA, Octo, pi0-open variants	Closed VLAs (various vendors)	State-of-the-art generalization on broad tasks
Dataset conventions	DROID-style, Open X-Embodiment	Proprietary dataset schemas	Only when stuck with vendor ecosystem
Tactile sensing	GelSight family (open designs)	Proprietary fingertip sensors	Productized integration on specific hands
Teleoperation	ALOHA-style, OpenArm, glove-based OSS	Vendor-locked VR/teleop SDKs	Consumer humanoid vendor integrations

2. Middleware: ROS 2 Jazzy

ROS 2 Jazzy is now the default runtime for serious research robots. The tooling improvements over the last two years (better launch system, stabilized DDS implementations, improved Python ergonomics) have closed most of the remaining gaps with ROS 1. Practical implications for labs:

Most modern research arms ship with ROS 2 drivers. If a vendor still only supports ROS 1, it is a yellow flag.
LeRobot, Robosuite, and most teleoperation packages now have ROS 2 bridges.
Multi-robot setups are materially easier on ROS 2 than ROS 1.

The closed alternative, NVIDIA's Isaac ROS, is a set of GPU-accelerated perception and control packages on top of ROS 2. It is excellent if your pipeline leans heavily on Jetson inference; it is over-kill and vendor-coupled otherwise.

3. Data and policy layer: LeRobot

LeRobot has become the default choice for a reason. It is opinionated enough to enforce a coherent dataset format, general enough to host most current policy architectures, and active enough that the documentation is actually current. Our LeRobot guide and LeRobot framework getting started cover the hands-on side.

What LeRobot does well: dataset conversion from common formats (ALOHA, DROID, OpenArm), shared training loop for ACT/Diffusion Policy/VQ-BeT/VLA-adapter architectures, checkpoint and config provenance. What it does less well: robust online serving on resource-constrained robot-side compute, multi-embodiment policy abstractions. Expect both to improve over 2026.

4. Simulation: Robosuite or Isaac Lab

Sim remains a second-class citizen in most real-world policy pipelines, but it is a genuinely useful first-class citizen for RL research and for policy pre-screening. The two open options:

Robosuite

MuJoCo-based, Python-native, easy to author new scenes in. Best for imitation-learning research, sim-to-real experiments where physics fidelity is secondary, and quick iteration. Mature, stable, widely used.

Isaac Lab

GPU-accelerated, based on Isaac Sim, supports thousands of environments in parallel. The right choice for reinforcement learning with large rollouts, domain randomization at scale, and any research where throughput is the bottleneck. Our Isaac Lab getting started covers setup and common pitfalls. Partially open — the surrounding Isaac Sim ecosystem has closed components, and you are coupled to NVIDIA hardware.

Our sim-to-real notes are in sim-to-real transfer and practitioner sim-to-real tips.

5. Foundation policy: OpenVLA, Octo, and friends

The foundation-policy layer is where the open ecosystem has made the most dramatic strides since 2024. The choice today:

Octo: smaller, faster to fine-tune, forgiving of small datasets and heterogeneous embodiments. Default for solo researchers and small labs.
OpenVLA: larger, better pre-trained coverage, supports language conditioning out of the box. Default for teams with >500 demos and language-conditioned tasks.
Diffusion Policy: not a VLA but the default for contact-rich manipulation with a few hundred demos. Often complementary to Octo/OpenVLA for specific task types.
ACT: action-chunking transformer. The bimanual workhorse. See ACT policy explained and ACT vs Diffusion Policy.

The broader /vla-models/ catalog is kept current. Our compute-side guidance is in scaling VLA training on a budget.

6. Dataset conventions: DROID and Open X-Embodiment

Interoperability across datasets is the multiplier that makes cross-embodiment pre-training possible. Open X-Embodiment (OXE) is the reference dataset collection; see our OXE explainer. DROID-style conventions (multi-camera, rich metadata, language annotations) have become the de facto standard for new-data releases.

Practical guidance: record in LeRobot's format natively or provide a clean converter to it. Make your data consumable by OXE-style pre-training runs even if you never contribute upstream. Your future-self will thank you when you want to fine-tune the next-generation VLA. Our curated datasets at /datasets/ and data services at /data-services/ are designed around these conventions.

7. Tactile sensing: GelSight and the long tail

Optical tactile sensors in the GelSight family (GelSight, GelSight Mini, DIGIT) have become cheap enough that serious labs can standardize on them for contact-rich research. The open designs, open calibration pipelines, and growing body of published learning recipes make this one of the more accessible research directions. Our contact forces essay frames the why; our force-torque overview covers the adjacent modality. The best dexterous robot hands piece covers hands that integrate tactile.

8. Teleoperation and data collection

Teleoperation is the feedstock for everything above. Open-source-friendly options in 2026:

ALOHA-style leader-follower. Best for bimanual tabletop. See the ALOHA guide and Mobile ALOHA setup.
OpenArm-based setups. Open bill of materials, modern software stack, growing community. See OpenArm setup and the OpenArm vs Franka comparison.
VR teleoperation. For long-reach and mobile; more commercial than purely open but several open packages integrate cleanly. See VR teleoperation companies.
Glove-based dexterous capture. The most research-forward path; hardware diversity is still high.

The quality discipline on top is in our teleoperation data quality checklist.

9. The closed alternative: NVIDIA Isaac + proprietary VLAs

For labs and enterprises that prefer an integrated commercial stack, NVIDIA's Isaac family (Isaac Sim, Isaac ROS, Isaac Manipulator, Isaac Perceptor) plus a proprietary VLA is a credible alternative. It is not open-source in the commodity sense, but it is internally consistent, well-supported, and well-documented. You pay in licensing, vendor coupling, and in the loss of the ability to reproduce your pipeline on non-NVIDIA hardware.

Our honest take: for pure research, open wins. For enterprise pilots where integration burden is the bottleneck and compute is already NVIDIA, closed is defensible. The middle case — a research lab with a commercial funder — usually ends up open by default and closed-for-specific-components over time.

10. A concrete recommended starting configuration

If you are starting a new lab this quarter, here is what we would stand up:

Hardware: one bimanual platform (ALOHA or equivalent) and one single-arm research arm (OpenArm or Franka). Source options in the SVRC store and compare in the compare tool.
Compute: one workstation with an A6000 or L40S (48GB) and cloud spot access for larger runs.
Middleware: ROS 2 Jazzy.
Data/training: LeRobot with the LeRobot dataset v2 format. Curate your own first task of ~300 demos before touching any model.
Foundation models: Octo for initial fine-tunes; OpenVLA once dataset exceeds ~500 episodes. Diffusion Policy for contact-rich side projects.
Simulation: Robosuite for week one. Isaac Lab only if you have an RL-heavy research question.
Tactile: one GelSight-family sensor per research arm.
Docs and tutorials: our buyer guides and tutorials cover the onboarding. For leasing options rather than capex, see SVRC leasing.

11. Anti-patterns to avoid

Building on one VLA forever. Design your data and serving pipeline to be portable between at least two. The field is moving.
Rolling your own dataset format. Use LeRobot's. Every hour you spend building your own is an hour lost on real research.
Premature distributed training. A single H100 is plenty for most fine-tunes. Multi-node introduces failure modes that will eat your quarter.
Ignoring the data pipeline. The best policy architecture will not rescue a bad dataset. See our data quality checklist.
Skipping ROS 2. Tempting and always a mistake at scale.

12. Closing note

The 2026 open-source robotics stack is not perfect, but it is coherent, reproducible, and affordable in a way it simply was not two years ago. The pieces fit. A small lab with two researchers and a modest hardware budget can now compete with well-funded industrial groups on any narrow-task benchmark, and can publish results that other labs can reproduce without signing any licensing agreements. That is a genuinely new state of affairs.

For the quarterly state of the field, see our state of robot learning Q1 2026. For supply-chain realities on the hardware side, see humanoid supply chain reality check. For the research agenda this stack unlocks, see why bimanual is the next frontier. And if you want help sizing a program, get in touch.

Take the stack for a spin. Browse hardware, sign up for leasing, explore open datasets, and review open VLAs. The open-source robot learning stack is ready when you are.

13. Frequently asked questions

Is the open stack really competitive with the closed one for research?

For reproducible narrow-task research, yes. Open VLAs plus LeRobot plus a decent teleop pipeline will produce results that are competitive with anything published by closed labs on single-task benchmarks. The gap, where it exists, is in cross-task and cross-embodiment generalization, which is more data-bound than architecture-bound.

Do we need Isaac Sim if we use Robosuite?

Not unless you need GPU-scale parallelism or high-fidelity physics for RL. Robosuite is sufficient for most imitation-learning research and for sim-to-real prototyping. Isaac Lab is the right tool when you are running thousands of parallel environments, not one or two.

What about MuJoCo MPC and model-based control?

MuJoCo MPC is a strong open-source option for model-based control research, especially for locomotion and whole-body tasks on humanoids. It pairs naturally with Robosuite for simulated evaluation and with ROS 2 for hardware integration. If your research agenda emphasizes control-theoretic work over learning, it deserves a serious look alongside the main stack above.

Can we mix open-source policies with closed hardware?

Yes. Most vendor SDKs expose joint-level interfaces that LeRobot-trained policies can drive. The friction is typically at the calibration and observation-sync layer, not the policy layer itself. Plan for a week of integration per new platform.

How fast does this stack move?

Fast. We refresh our open-stack recommendations quarterly — see our quarterly state review for the current snapshot. Pick a commit, pin your dependencies, and document your versions; the reproducibility discipline matters more when the upstream is moving.