nvidianvidia-noncommercial-license
R4D-Bench
A region-level 4D video question answering benchmark with 1,419 region-prompted multiple-choice VQA pairs built from dynamic real-world videos. It challenges models to track, reason about depth, and understand temporal dynamics of specific regions in video.
Downloads50
Likes4
Technical Profile
- Modalities
- rgblanguage
- Environment
- outdoorurban
- Task Types
- visual_question_answering3d_groundingspatial_reasoningmotion_understanding
- Data Format
- json
- License
- nvidia-noncommercial-license
Access
Need custom rgb data?
Claru builds purpose-built datasets for outdoor applications with dense human annotations and quality assurance.
Request a Sample Pack