CrossTimeBenchcc-by-4.0
CrossTimeBench
A comprehensive benchmark for evaluating Multimodal Large Language Models on cross-temporal video reasoning in complex multi-event scenarios, consolidating 12 existing VQA datasets with 2,607 videos and 8,496 QA pairs requiring integration of information from multiple temporal segments.
Downloads33
Technical Profile
- Modalities
- rgblanguage
- Task Types
- video-classificationquestion-answeringtemporal-reasoningaction-anticipationtemporal-localization
- Data Format
- json
- License
- cc-by-4.0
Access
Need custom rgb data?
Claru builds purpose-built datasets for any environment applications with dense human annotations and quality assurance.
Request a Sample Pack