VLA-Arenaapache-2.0

VLA-Arena Dataset (L0 - Large Variant)

Name: VLA-Arena Dataset (L0 - Large Variant)
Creator: VLA-Arena
License: apache-2.0
Keywords: rgb, proprioception, language, manipulation, grasping, pick_and_place, long_horizon, simulation

An open-source benchmark for evaluating Vision-Language-Action (VLA) models with 60 tasks at difficulty level 0, containing 3,000 human demonstrations across safety, distractor, extrapolation, and long-horizon domains. The dataset features high-resolution RGB images (256x256), robot state vectors, 7-DoF continuous control signals, and natural language task instructions.

Downloads356

Technical Profile

Modalities: rgbproprioceptionlanguage
Environment: simulation
Task Types: manipulationgraspingpick_and_placelong_horizon
Data Format: LeRobot
License: apache-2.0

Part of the VLA-Arena Dataset (L0 - Large Variant) family

Access

View on HuggingFace

Need custom rgb data?

Claru builds purpose-built datasets for simulation applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets

VLA-Arena Dataset (L0 - Large Variant)

An open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models, featuring 60 tasks at difficulty level 0 with 3,000 human demonstrations across safety, distractor, extrapolation, and long-horizon domains.

rgbproprioceptionlanguage

2K downloadsFeb 2026apache-2.0

VLA-Arena Dataset (L0 - Large Variant)

An open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models with 60 tasks at difficulty level 0, containing 3,000 human demonstrations (50 per task) across safety, distractor, extrapolation, and long horizon domains. The dataset is strictly formatted in RLDS with high-resolution RGB images (256x256), robot state vectors, 7-DoF continuous control signals, and natural language task instructions.

rgbproprioceptionlanguage

243 downloadsDec 2025apache-2.0

Jetson1 060826 Subtask Grab2 TsFile

An Apache TsFile conversion of a LeRobot v3.0 robot dataset containing time-series observations and actions from a vibeboard_follower_tilt robot performing grasping tasks. The dataset includes 55 episodes with 7,683 frames of numeric robot state and action data at 20 fps.

timeseriesproprioception

null downloadsJul 2026apache-2.0

CSI-Agent Towel Folding Dataset

A dataset of towel folding demonstrations collected using LeRobot with a bimanual SO-101 robot. Contains 50 episodes of towel folding tasks with multi-camera RGB observations and joint position actions.

rgbproprioception

null downloadsJul 2026apache-2.0

WGO-Bench Localization Given Labels

A temporal localization benchmark where models are given robot manipulation videos and shuffled event labels with multiplicities, and must predict the start and end times for each labeled event occurrence. Derived from Macrodata Labs' WGO-Bench with 100 episodes split into 80 training and 20 test episodes.

rgblanguage

null downloadsJul 2026cc-by-nc-sa-4.0

PerSim: Personality-Conditioned Household Object-Movement Trajectories

Synthetic multi-day object-movement trajectories from 16 personas spanning Big-Five trait space across 5 interactive home scenes, containing 27,976 object-movement actions with time-of-day, trigger, and natural-language context.

language

null downloadsJul 2026cc-by-4.0