Helios1208cc-by-4.0

BRIGHT+

An upgraded text retrieval benchmark designed to support reasoning-intensive retrieval across diverse domains including robotics, biology, economics, and more. It applies MARCUS, a multi-agent LLM-based pipeline, to clean and semantically chunk the original BRIGHT dataset, significantly improving signal-to-noise ratio and retrieval effectiveness.

Downloads70
Likes1

Technical Profile

Modalities
text
Task Types
text-retrieval
License
cc-by-4.0
Part of the BRIGHT+ family

Access

Need custom text data?

Claru builds purpose-built datasets for any environment applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets