VincentNiapache-2.0
RoboTwin stack_blocks_three — Wan2.2 TI2V Rollouts
160 text+image-to-video rollouts for the stack_blocks_three task from RoboTwin, generated with the Wan2.2 TI2V (5B) diffusion model and scored with the blocks_stack_v2 reward model. Companion dataset to the EmbodiedVideoRL / DanceGRPO reward-model work.
Downloads4
Episodes160
Why This Matters for Physical AI
This dataset enables training and evaluation of reward models for video-based robotic control by providing ground-truth task success labels and detailed per-frame tracking annotations for a complex manipulation task.
Technical Profile
- Modalities
- rgblanguage
- Robot Embodiments
- dual_arm_manipulator
- Action Space
- joint_positions
- Environment
- simulation
- Task Types
- pick_and_placestackingmanipulation
- Episodes
- 160
- Data Format
- mp4
- Annotation Types
- language_instructionsreward_labelsaction_labelssegmentationbounding_boxes
- License
- apache-2.0
Access
Need custom rgb data?
Claru builds purpose-built datasets for simulation applications with dense human annotations and quality assurance.
Request a Sample Pack