oxe-auge2025cc-by-4.0
language_table_train_80000_85000_augmented
An augmented subset of the Language Table dataset containing 5,000 episodes with cross-embodiment visual and proprioceptive data for 8 different robot platforms, augmented through domain randomization techniques to enable scaling of cross-embodiment policy learning.
Downloads86
Episodes5000
Why This Matters for Physical AI
This dataset enables training of language-conditioned robotic policies that generalize across multiple robot embodiments through systematic visual and kinematic augmentation, advancing cross-embodiment learning for physical AI systems.
Technical Profile
- Modalities
- rgbproprioceptionlanguage
- Robot Embodiments
- Google RobotJacoKinova3KUKA IIWAFranka PandaSawyerUR5e
- Environment
- lab
- Task Types
- manipulation
- Episodes
- 5000
- Data Format
- parquet
- Annotation Types
- language_instructions
- License
- cc-by-4.0
Access
Need custom rgb data?
Claru builds purpose-built datasets for lab applications with dense human annotations and quality assurance.
Request a Sample Pack