Abdrahunknown

SmolRGPT Open Spatial Dataset 110k Subset

A multimodal dataset with RGB and depth images paired with textual annotations for spatial reasoning tasks in warehouse environments, designed to train compact vision-language models with efficient spatial understanding capabilities.

Downloads70

Technical Profile

Modalities
rgbdepthlanguage
Environment
warehouse
Task Types
spatial_reasoningvision_language
Data Format
HDF5
License
unknown
Part of the SmolRGPT Open Spatial Dataset 110k Subset family

Access

Need custom rgb data?

Claru builds purpose-built datasets for warehouse applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets