hamzasheedi2025

MimicDroid: In-Context Learning for Humanoid Robot Manipulation from Human Play Videos

A dataset of 8 hours of human play videos and simulated humanoid robot manipulation tasks across 30 objects and 8 kitchen environments, designed to enable in-context learning for humanoid robots to solve new manipulation tasks from few video examples.

Downloads110
Hours8

Why This Matters for Physical AI

MimicDroid demonstrates how large-scale human play videos can serve as scalable training data for in-context learning in humanoid robots, advancing the capability of embodied AI systems to generalize to novel manipulation tasks with minimal task-specific supervision.

Technical Profile

Modalities
rgb
Robot Embodiments
humanoid
Environment
simulationkitchen
Task Types
manipulationpick_and_placegrasping
Total Hours
8
Annotation Types
language_instructions
Part of the RoboCasa family

Access

Need custom rgb data?

Claru builds purpose-built datasets for simulation applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets