oxe-auge2025cc-by-4.0
language_table_train_295000_300000_augmented
An augmented subset of the Language Table dataset containing 5,000 episodes with multi-robot cross-embodiment visual and proprioceptive data, augmented across 8 different robot platforms (Google Robot, Jaco, Kinova3, KUKA iiwa, Panda, Sawyer, UR5e) with natural language task instructions.
Downloads77
Episodes5000
Why This Matters for Physical AI
This dataset enables training of cross-embodiment robot policies by providing augmented visual and proprioceptive data across multiple robot platforms with consistent natural language task instructions, advancing scalable multi-robot learning.
Technical Profile
- Modalities
- rgbproprioceptionlanguage
- Robot Embodiments
- google_robotjacokinova3kuka_iiwapandasawyerur5e
- Action Space
- joint_positions
- Environment
- lab
- Task Types
- manipulation
- Episodes
- 5000
- Data Format
- parquet
- Annotation Types
- language_instructions
- License
- cc-by-4.0
Access
Need custom rgb data?
Claru builds purpose-built datasets for lab applications with dense human annotations and quality assurance.
Request a Sample Pack