SpaceVistacc-by-4.0
SpaceVista: All-Scale Visual Spatial Reasoning from mm to km
A large-scale dataset of 1M-10M video-text pairs spanning spatial scales from millimeters (tabletop) to kilometers (drone), with annotations for visual spatial reasoning tasks including point, bounding box, mask, and cross-frame object matching.
Downloads614
Likes1
Technical Profile
- Modalities
- rgblanguage
- Environment
- labhomeindooroutdoor
- Task Types
- spatial_reasoningvideo_understandingvisual_question_answering
- Data Format
- JSON with frame paths
- License
- cc-by-4.0
Community Signals
Top 50% by downloads
Access
Need custom rgb data?
Claru builds purpose-built datasets for lab applications with dense human annotations and quality assurance.
Request a Sample Pack