VLA-Arena2025apache-2.0
VLA-Arena Dataset (L0 - Medium Variant)
Level 0 (L0) - Medium (M) variant of the VLA-Arena benchmark containing 1,800 human demonstrations across 60 distinct tasks for training Vision-Language-Action models. The dataset is formatted according to Lerobot specifications and optimized for training the SmolVLA model.
Downloads80
Episodes1800
Why This Matters for Physical AI
VLA-Arena provides a systematic benchmark for evaluating Vision-Language-Action models across safety, generalization, and long-horizon reasoning tasks, which are critical for deploying embodied AI agents in real-world robotic applications.
Technical Profile
- Modalities
- rgbproprioceptionlanguage
- Action Space
- end_effector_delta
- Environment
- simulation
- Task Types
- manipulationgraspingpick_and_place
- Episodes
- 1800
- Data Format
- LeRobot
- Annotation Types
- language_instructionsaction_labels
- License
- apache-2.0
Access
Need custom rgb data?
Claru builds purpose-built datasets for simulation applications with dense human annotations and quality assurance.
Request a Sample Pack