sanskxr02mit
Canonical Object-Physics Dataset (v3.0)
A dataset of 216 annotated images designed for training Vision-Language Models on physical property estimation tasks, mapping visual instances to ground-truth mass and friction values.
Downloads226
Episodes216
Why This Matters for Physical AI
This dataset enables vision-language models to learn robust physical property estimation from visual input, a critical capability for embodied AI systems that must reason about object physics for manipulation and interaction tasks.
Technical Profile
- Modalities
- rgblanguage
- Environment
- kitchen
- Task Types
- physics-reasoningobject-detection
- Episodes
- 216
- Data Format
- imagefolder
- Annotation Types
- bounding_boxeslanguage_instructionsreward_labels
- License
- mit
Community Signals
Top 50% by downloads
HuggingFace Discussions1
Access
Need custom rgb data?
Claru builds purpose-built datasets for kitchen applications with dense human annotations and quality assurance.
Request a Sample Pack