VLA-Arenaapache-2.0

VLA-Arena Dataset (L1 - Small Variant)

Name: VLA-Arena Dataset (L1 - Small Variant)
Creator: VLA-Arena
License: apache-2.0
Keywords: rgb, proprioception, language, manipulation, grasping, object_manipulation, multi-step_planning, simulation

An open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models, featuring 55 tasks at difficulty level L1 with 550 human demonstrations across safety, distractor, extrapolation, and long horizon domains.

Downloads182

Technical Profile

Modalities: rgbproprioceptionlanguage
Environment: simulation
Task Types: manipulationgraspingobject_manipulationmulti-step_planning
Data Format: LeRobot
License: apache-2.0

Part of the VLA-Arena Dataset (L1 - Small Variant) family

Access

View on HuggingFace

Need custom rgb data?

Claru builds purpose-built datasets for simulation applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets

VLA-Arena Dataset (L1 - Small Variant)

An open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models, featuring 55 tasks at difficulty level L1 with 550 human demonstrations formatted for training. Designed to assess robotic agents across safety, distraction handling, extrapolation, and long-horizon task capabilities.

rgbproprioceptionlanguage

554 downloadsDec 2025apache-2.0

VLA-Arena Dataset (L1 - Small Variant)

An open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models featuring 55 tasks across safety, distractor, extrapolation, and long-horizon domains at difficulty level L1. Contains 550 human demonstrations with high-resolution RGB images, robot state vectors, and natural language task instructions.

rgbproprioceptionlanguage

313 downloadsDec 2025apache-2.0

ABC-130k

The largest open-source robot teleoperation dataset containing bimanual manipulation trajectories collected on two-arm YAM stations with 130,822 episodes across 3,555 hours of data.

rgbproprioception

663K downloadsJul 2026apache-2.0

Hy-Embodied-0.5-VLA-Data

A large-scale bimanual manipulation dataset with 2,163 hours of high-fidelity demonstrations collected via custom fingertip UMI device with optical motion-capture, spanning 70+ manipulation tasks for training Vision-Language-Action foundation models.

rgbproprioceptionlanguage

235K downloadsJul 2026cc-by-4.0

T-Rex Dataset

A large-scale, tactile-reactive bimanual manipulation dataset collected via teleoperation on a Dexmate Vega-1 robot with two Sharpa Wave dexterous hands, featuring 5,464 episodes with tactile, RGB, and proprioceptive observations.

rgbtactileforce_torqueproprioception

170K downloadsJun 2026MIT

OmniAction

A large-scale multimodal dataset for proactive robot manipulation comprising 141,162 episodes with cross-modal contextual instructions derived from spoken dialogue, environmental sounds, and visual cues rather than explicit commands.

rgbaudiolanguage

95K downloadsMar 2026cc-by-nc-4.0