oxe-auge2025cc-by-4.0

language_table_train_5000_10000_augmented

An augmented dataset of 5,000 episodes containing language-instructed robotic manipulation tasks with cross-embodiment augmentation for 8 different robot platforms. Includes visual observations, joint states, end-effector poses, and natural language instructions.

Downloads82
Episodes5000

Why This Matters for Physical AI

This dataset enables training of cross-embodiment robotic policies by providing augmented visual and proprioceptive observations paired with natural language instructions across multiple robot platforms, advancing generalization in language-conditioned robot learning.

Technical Profile

Modalities
rgbproprioceptionlanguage
Robot Embodiments
Google RobotJacoKinova3KUKA IIWAFranka PandaSawyerUR5e
Environment
lab
Task Types
manipulationlanguage-instructed manipulation
Episodes
5000
Data Format
parquet
Annotation Types
language_instructions
License
cc-by-4.0
Part of the OXE-AugE family

Access

Need custom rgb data?

Claru builds purpose-built datasets for lab applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets