VLA-Arena2025apache-2.0
VLA-Arena Dataset (L1 - Small Variant)
An open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models, featuring 55 tasks at difficulty level L1 with 550 human demonstrations across safety, distractor, extrapolation, and long horizon domains.
Downloads556
Episodes550
Why This Matters for Physical AI
VLA-Arena provides a systematic benchmark for evaluating vision-language-action models across safety, generalization, and long-horizon reasoning capabilities essential for real-world robotic deployment.
Technical Profile
- Modalities
- rgbproprioceptionlanguage
- Action Space
- end_effector_delta
- Environment
- simulation
- Task Types
- manipulationgraspingobject_manipulationmulti-step_planning
- Episodes
- 550
- Data Format
- LeRobot
- Annotation Types
- language_instructionsaction_labels
- License
- apache-2.0
Community Signals
Top 25% by downloads
HuggingFace Discussions1
Access
Need custom rgb data?
Claru builds purpose-built datasets for simulation applications with dense human annotations and quality assurance.
Request a Sample Pack