Abdrah2025
SmolRGPT Spatial Warehouse Dataset
A multimodal dataset integrating RGB and depth images for spatial reasoning tasks in warehouse environments, designed to train efficient vision-language models for resource-constrained robotics and industrial applications.
Downloads52
Episodes501025
Likes1
Why This Matters for Physical AI
This dataset enables training efficient vision-language models with spatial reasoning capabilities that can be deployed in resource-constrained warehouse and robotics environments without sacrificing performance.
Technical Profile
- Modalities
- rgbdepthlanguage
- Environment
- warehouse
- Task Types
- spatial-reasoning
- Episodes
- 501025
- Data Format
- HuggingFace
- Annotation Types
- language_instructionssegmentation
Access
Need custom rgb data?
Claru builds purpose-built datasets for warehouse applications with dense human annotations and quality assurance.
Request a Sample Pack