CrossTimeBenchcc-by-4.0

CrossTimeBench

A comprehensive benchmark for evaluating Multimodal Large Language Models on cross-temporal video reasoning in complex multi-event scenarios, consolidating 12 existing VQA datasets with 2,607 videos and 8,496 QA pairs requiring integration of information from multiple temporal segments.

Downloads33

Technical Profile

Modalities
rgblanguage
Task Types
video-classificationquestion-answeringtemporal-reasoningaction-anticipationtemporal-localization
Data Format
json
License
cc-by-4.0
Part of the CrossTimeBench family

Access

Need custom rgb data?

Claru builds purpose-built datasets for any environment applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets