siyichapache-2.0

SpaceTools Evaluation Benchmarks

Evaluation benchmarks for spatial reasoning with vision tool calling across 9 different tasks including robot spatial reasoning, object localization, pose estimation, and depth comparison.

Downloads55
Episodes2121

Why This Matters for Physical AI

This benchmark evaluates spatial reasoning capabilities critical for robotic manipulation and object interaction, testing vision models on robot-relevant tasks like pose estimation and grasp planning.

Technical Profile

Modalities
rgbdepth
Task Types
spatial_reasoningobject_localizationobject_placementpose_estimationgrasp_estimationdepth_orderingspatial_relation
Episodes
2121
Data Format
parquet
Annotation Types
language_instructionsreward_labelsbounding_boxes
License
apache-2.0
Part of the SpaceTools Evaluation Benchmarks family

Access

Need custom rgb data?

Claru builds purpose-built datasets for any environment applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets