oxe-auge2025cc-by-4.0
language_table_train_225000_230000_augmented
An augmented subset of the Language Table dataset containing 5,000 episodes with robot embodiment augmentation across 8 different robot platforms, featuring natural language instructions and multimodal observations.
Downloads66
Episodes5000
Why This Matters for Physical AI
This dataset enables cross-embodiment policy learning by providing augmented visual and proprioceptive data across multiple robot platforms with natural language instructions, advancing the ability to train generalizable robotic manipulation policies.
Technical Profile
- Modalities
- rgbproprioceptionlanguage
- Robot Embodiments
- google_robotjacokinova3kuka_iiwapandasawyerur5e
- Environment
- lab
- Task Types
- manipulation
- Episodes
- 5000
- Data Format
- parquet
- Annotation Types
- language_instructions
- License
- cc-by-4.0
Access
Need custom rgb data?
Claru builds purpose-built datasets for lab applications with dense human annotations and quality assurance.
Request a Sample Pack