- Embodiment diversityMultiple robots improve generalization but add alignment work.
- Language groundingInstruction consistency affects downstream conditioning.
- Standardized actionsPolicy training becomes easier when formats are explicit and reusable.
Foundation model training datasets for robotics
Foundation model datasets need breadth across tasks, embodiments, and action formats, but quality still matters more than simple scale.
This cluster helps ML teams compare whether public ecosystem datasets can support a foundation-model path or if they need domain-specific expansion.