oxe-auge2025cc-by-4.0

language_table_train_410000_415000_augmented

An augmented subset of the Language Table dataset containing 5,000 episodes with cross-embodiment image augmentation across 8 different robot platforms, enabling training of generalist manipulation policies with natural language instructions.

Downloads63
Episodes5000

Why This Matters for Physical AI

This dataset enables training of cross-embodiment manipulation policies by providing augmented visual observations across multiple robot platforms paired with natural language instructions, advancing generalist robot learning at scale.

Technical Profile

Modalities
rgbproprioceptionlanguage
Robot Embodiments
google_robotjacokinova3kuka_iiwapandasawyerur5e
Action Space
joint_positions
Environment
lab
Task Types
manipulation
Episodes
5000
Data Format
parquet
Annotation Types
language_instructions
License
cc-by-4.0
Part of the OXE-AugE family

Access

Need custom rgb data?

Claru builds purpose-built datasets for lab applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets