VLA-Arena2025apache-2.0
VLA-Arena Dataset (L1 - Small Variant)
An open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models featuring 55 tasks across safety, distractor, extrapolation, and long-horizon domains at difficulty level L1. Contains 550 human demonstrations with high-resolution RGB images, robot state vectors, and natural language task instructions.
Downloads217
Episodes550
Why This Matters for Physical AI
VLA-Arena provides a systematic benchmark for evaluating vision-language-action models across critical dimensions of safety, generalization, and long-horizon reasoning essential for deploying embodied AI agents in real-world robotic applications.
Technical Profile
- Modalities
- rgbproprioceptionlanguage
- Action Space
- end_effector_delta
- Environment
- simulation
- Task Types
- manipulationgraspingpick_and_place
- Episodes
- 550
- Data Format
- hdf5
- Annotation Types
- language_instructionsaction_labels
- License
- apache-2.0
Community Signals
Top 50% by downloads
Access
Need custom rgb data?
Claru builds purpose-built datasets for simulation applications with dense human annotations and quality assurance.
Request a Sample Pack