VincentNiapache-2.0

RoboTwin stack_blocks_three — Wan2.2 TI2V Rollouts

160 text+image-to-video rollouts for the stack_blocks_three task from RoboTwin, generated with the Wan2.2 TI2V (5B) diffusion model and scored with the blocks_stack_v2 reward model. Companion dataset to the EmbodiedVideoRL / DanceGRPO reward-model work.

Downloads4
Episodes160

Why This Matters for Physical AI

This dataset enables training and evaluation of reward models for video-based robotic control by providing ground-truth task success labels and detailed per-frame tracking annotations for a complex manipulation task.

Technical Profile

Modalities
rgblanguage
Robot Embodiments
dual_arm_manipulator
Action Space
joint_positions
Environment
simulation
Task Types
pick_and_placestackingmanipulation
Episodes
160
Data Format
mp4
Annotation Types
language_instructionsreward_labelsaction_labelssegmentationbounding_boxes
License
apache-2.0
Part of the RoboTwin family

Access

Need custom rgb data?

Claru builds purpose-built datasets for simulation applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets