Abdrah2025

SmolRGPT Spatial Warehouse Dataset

A multimodal dataset integrating RGB and depth images for spatial reasoning tasks in warehouse environments, designed to train efficient vision-language models for resource-constrained robotics and industrial applications.

Downloads52
Episodes501025
Likes1

Why This Matters for Physical AI

This dataset enables training efficient vision-language models with spatial reasoning capabilities that can be deployed in resource-constrained warehouse and robotics environments without sacrificing performance.

Technical Profile

Modalities
rgbdepthlanguage
Environment
warehouse
Task Types
spatial-reasoning
Episodes
501025
Data Format
HuggingFace
Annotation Types
language_instructionssegmentation
Part of the SmolRGPT family

Access

Need custom rgb data?

Claru builds purpose-built datasets for warehouse applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets