Abdrah2025unknown
SmolRGPT Open Spatial Dataset 110k Subset
A multimodal dataset with RGB and depth images paired with textual annotations for spatial reasoning tasks in warehouse environments, designed to train compact vision-language models with efficient spatial understanding capabilities.
Downloads100
Episodes1100010
Why This Matters for Physical AI
This dataset enables training of efficient vision-language models with spatial reasoning capabilities suitable for resource-constrained robotic and warehouse environments where both computational efficiency and robust spatial understanding are critical.
Technical Profile
- Modalities
- rgbdepthlanguage
- Environment
- warehouse
- Task Types
- spatial_reasoningvision_language
- Episodes
- 1100010
- Data Format
- HDF5
- Annotation Types
- language_instructionssegmentation
- License
- unknown
Access
Need custom rgb data?
Claru builds purpose-built datasets for warehouse applications with dense human annotations and quality assurance.
Request a Sample Pack