hamzasheedi2025
MimicDroid: In-Context Learning for Humanoid Robot Manipulation from Human Play Videos
A dataset of 8 hours of human play videos and simulated humanoid robot manipulation tasks across 30 objects and 8 kitchen environments, designed to enable in-context learning for humanoid robots to solve new manipulation tasks from few video examples.
Downloads110
Hours8
Why This Matters for Physical AI
MimicDroid demonstrates how large-scale human play videos can serve as scalable training data for in-context learning in humanoid robots, advancing the capability of embodied AI systems to generalize to novel manipulation tasks with minimal task-specific supervision.
Technical Profile
- Modalities
- rgb
- Robot Embodiments
- humanoid
- Environment
- simulationkitchen
- Task Types
- manipulationpick_and_placegrasping
- Total Hours
- 8
- Annotation Types
- language_instructions
Access
Need custom rgb data?
Claru builds purpose-built datasets for simulation applications with dense human annotations and quality assurance.
Request a Sample Pack