sanskxr02mit

Canonical Object-Physics Dataset (v3.0)

A dataset of 216 annotated images designed for training Vision-Language Models on physical property estimation tasks, mapping visual instances to ground-truth mass and friction values.

Downloads226
Episodes216

Why This Matters for Physical AI

This dataset enables vision-language models to learn robust physical property estimation from visual input, a critical capability for embodied AI systems that must reason about object physics for manipulation and interaction tasks.

Technical Profile

Modalities
rgblanguage
Environment
kitchen
Task Types
physics-reasoningobject-detection
Episodes
216
Data Format
imagefolder
Annotation Types
bounding_boxeslanguage_instructionsreward_labels
License
mit
Part of the Canonical Object-Physics Dataset (v3.0) family

Community Signals

Top 50% by downloads
HuggingFace Discussions1

Access

Need custom rgb data?

Claru builds purpose-built datasets for kitchen applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets