VLA-Arena2025apache-2.0
VLA-Arena Dataset (L0 - Large Variant)
An open-source benchmark for evaluating Vision-Language-Action (VLA) models with 60 tasks at difficulty level 0, containing 3,000 human demonstrations across safety, distractor, extrapolation, and long-horizon domains. The dataset features high-resolution RGB images (256x256), robot state vectors, 7-DoF continuous control signals, and natural language task instructions.
Downloads185
Episodes3000
Why This Matters for Physical AI
VLA-Arena provides a comprehensive benchmark for training and evaluating vision-language-action models with systematic evaluation across safety, generalization, and long-horizon task capabilities essential for real-world robotic deployment.
Technical Profile
- Modalities
- rgbproprioceptionlanguage
- Action Space
- end_effector_delta
- Environment
- simulation
- Task Types
- manipulationgraspingpick_and_placelong_horizon
- Episodes
- 3000
- Data Format
- LeRobot
- Annotation Types
- language_instructionsaction_labels
- License
- apache-2.0
Community Signals
Top 50% by downloads
Access
Need custom rgb data?
Claru builds purpose-built datasets for simulation applications with dense human annotations and quality assurance.
Request a Sample Pack