VLA-Arena2025apache-2.0

VLA-Arena Dataset (L0 - Medium Variant)

Level 0 (L0) - Medium (M) variant of the VLA-Arena benchmark containing 1,800 human demonstrations across 60 distinct tasks for training Vision-Language-Action models. The dataset is formatted according to Lerobot specifications and optimized for training the SmolVLA model.

Downloads80
Episodes1800

Why This Matters for Physical AI

VLA-Arena provides a systematic benchmark for evaluating Vision-Language-Action models across safety, generalization, and long-horizon reasoning tasks, which are critical for deploying embodied AI agents in real-world robotic applications.

Technical Profile

Modalities
rgbproprioceptionlanguage
Action Space
end_effector_delta
Environment
simulation
Task Types
manipulationgraspingpick_and_place
Episodes
1800
Data Format
LeRobot
Annotation Types
language_instructionsaction_labels
License
apache-2.0
Part of the VLA-Arena family

Access

Need custom rgb data?

Claru builds purpose-built datasets for simulation applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets