VLA-Arenaapache-2.0

VLA-Arena Dataset (L1 - Small Variant)

Name: VLA-Arena Dataset (L1 - Small Variant)
Creator: VLA-Arena
License: apache-2.0
Keywords: rgb, proprioception, language, manipulation, grasping, pick_and_place, simulation

An open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models featuring 55 tasks across safety, distractor, extrapolation, and long-horizon domains at difficulty level L1. Contains 550 human demonstrations with high-resolution RGB images, robot state vectors, and natural language task instructions.

Downloads316

Technical Profile

Modalities: rgbproprioceptionlanguage
Environment: simulation
Task Types: manipulationgraspingpick_and_place
Data Format: hdf5
License: apache-2.0

Part of the VLA-Arena Dataset (L1 - Small Variant) family

Access

View on HuggingFace

Need custom rgb data?

Claru builds purpose-built datasets for simulation applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets

VLA-Arena Dataset (L1 - Small Variant)

An open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models, featuring 55 tasks at difficulty level L1 with 550 human demonstrations formatted for training. Designed to assess robotic agents across safety, distraction handling, extrapolation, and long-horizon task capabilities.

rgbproprioceptionlanguage

554 downloadsDec 2025apache-2.0

VLA-Arena Dataset (L1 - Small Variant)

An open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models, featuring 55 tasks at difficulty level L1 with 550 human demonstrations across safety, distractor, extrapolation, and long horizon domains.

rgbproprioceptionlanguage

183 downloadsFeb 2026apache-2.0

Jetson1 060826 Subtask Grab2 TsFile

An Apache TsFile conversion of a LeRobot v3.0 robot dataset containing time-series observations and actions from a vibeboard_follower_tilt robot performing grasping tasks. The dataset includes 55 episodes with 7,683 frames of numeric robot state and action data at 20 fps.

timeseriesproprioception

null downloadsJul 2026apache-2.0

CSI-Agent Towel Folding Dataset

A dataset of towel folding demonstrations collected using LeRobot with a bimanual SO-101 robot. Contains 50 episodes of towel folding tasks with multi-camera RGB observations and joint position actions.

rgbproprioception

null downloadsJul 2026apache-2.0

WGO-Bench Localization Given Labels

A temporal localization benchmark where models are given robot manipulation videos and shuffled event labels with multiplicities, and must predict the start and end times for each labeled event occurrence. Derived from Macrodata Labs' WGO-Bench with 100 episodes split into 80 training and 20 test episodes.

rgblanguage

null downloadsJul 2026cc-by-nc-sa-4.0

PerSim: Personality-Conditioned Household Object-Movement Trajectories

Synthetic multi-day object-movement trajectories from 16 personas spanning Big-Five trait space across 5 interactive home scenes, containing 27,976 object-movement actions with time-of-day, trigger, and natural-language context.

language

null downloadsJul 2026cc-by-4.0