oxe-auge2025cc-by-4.0
language_table_train_5000_10000_augmented
An augmented dataset of 5,000 episodes containing language-instructed robotic manipulation tasks with cross-embodiment augmentation for 8 different robot platforms. Includes visual observations, joint states, end-effector poses, and natural language instructions.
Downloads82
Episodes5000
Why This Matters for Physical AI
This dataset enables training of cross-embodiment robotic policies by providing augmented visual and proprioceptive observations paired with natural language instructions across multiple robot platforms, advancing generalization in language-conditioned robot learning.
Technical Profile
- Modalities
- rgbproprioceptionlanguage
- Robot Embodiments
- Google RobotJacoKinova3KUKA IIWAFranka PandaSawyerUR5e
- Environment
- lab
- Task Types
- manipulationlanguage-instructed manipulation
- Episodes
- 5000
- Data Format
- parquet
- Annotation Types
- language_instructions
- License
- cc-by-4.0
Access
Need custom rgb data?
Claru builds purpose-built datasets for lab applications with dense human annotations and quality assurance.
Request a Sample Pack