SpaceVistacc-by-4.0

SpaceVista: All-Scale Visual Spatial Reasoning from mm to km

A large-scale dataset of 1M-10M video-text pairs spanning spatial scales from millimeters (tabletop) to kilometers (drone), with annotations for visual spatial reasoning tasks including point, bounding box, mask, and cross-frame object matching.

Downloads614
Likes1

Technical Profile

Modalities
rgblanguage
Environment
labhomeindooroutdoor
Task Types
spatial_reasoningvideo_understandingvisual_question_answering
Data Format
JSON with frame paths
License
cc-by-4.0
Part of the SpaceVista: All-Scale Visual Spatial Reasoning from mm to km family

Community Signals

Top 50% by downloads

Access

Need custom rgb data?

Claru builds purpose-built datasets for lab applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets