Abdrah2025unknown

SmolRGPT Open Spatial Dataset 110k Subset

A multimodal dataset with RGB and depth images paired with textual annotations for spatial reasoning tasks in warehouse environments, designed to train compact vision-language models with efficient spatial understanding capabilities.

Downloads100
Episodes1100010

Why This Matters for Physical AI

This dataset enables training of efficient vision-language models with spatial reasoning capabilities suitable for resource-constrained robotic and warehouse environments where both computational efficiency and robust spatial understanding are critical.

Technical Profile

Modalities
rgbdepthlanguage
Environment
warehouse
Task Types
spatial_reasoningvision_language
Episodes
1100010
Data Format
HDF5
Annotation Types
language_instructionssegmentation
License
unknown
Part of the Open Spatial Dataset family

Access

Need custom rgb data?

Claru builds purpose-built datasets for warehouse applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets