Helios1208cc-by-4.0
BRIGHT+
An upgraded text retrieval benchmark designed to support reasoning-intensive retrieval across diverse domains including robotics, biology, economics, and more. It applies MARCUS, a multi-agent LLM-based pipeline, to clean and semantically chunk the original BRIGHT dataset, significantly improving signal-to-noise ratio and retrieval effectiveness.
Downloads70
Likes1
Technical Profile
- Modalities
- text
- Task Types
- text-retrieval
- License
- cc-by-4.0
Access
Need custom text data?
Claru builds purpose-built datasets for any environment applications with dense human annotations and quality assurance.
Request a Sample Pack