siyichapache-2.0
SpaceTools Evaluation Benchmarks
Evaluation benchmarks for spatial reasoning with vision tool calling across 9 different tasks including robot spatial reasoning, object localization, pose estimation, and depth comparison.
Downloads55
Episodes2121
Why This Matters for Physical AI
This benchmark evaluates spatial reasoning capabilities critical for robotic manipulation and object interaction, testing vision models on robot-relevant tasks like pose estimation and grasp planning.
Technical Profile
- Modalities
- rgbdepth
- Task Types
- spatial_reasoningobject_localizationobject_placementpose_estimationgrasp_estimationdepth_orderingspatial_relation
- Episodes
- 2121
- Data Format
- parquet
- Annotation Types
- language_instructionsreward_labelsbounding_boxes
- License
- apache-2.0
Access
Need custom rgb data?
Claru builds purpose-built datasets for any environment applications with dense human annotations and quality assurance.
Request a Sample Pack