oxe-auge2025cc-by-4.0

language_table_train_80000_85000_augmented

An augmented subset of the Language Table dataset containing 5,000 episodes with cross-embodiment visual and proprioceptive data for 8 different robot platforms, augmented through domain randomization techniques to enable scaling of cross-embodiment policy learning.

Downloads86
Episodes5000

Why This Matters for Physical AI

This dataset enables training of language-conditioned robotic policies that generalize across multiple robot embodiments through systematic visual and kinematic augmentation, advancing cross-embodiment learning for physical AI systems.

Technical Profile

Modalities
rgbproprioceptionlanguage
Robot Embodiments
Google RobotJacoKinova3KUKA IIWAFranka PandaSawyerUR5e
Environment
lab
Task Types
manipulation
Episodes
5000
Data Format
parquet
Annotation Types
language_instructions
License
cc-by-4.0
Part of the OXE-AugE family

Access

Need custom rgb data?

Claru builds purpose-built datasets for lab applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets