yatin-superintelligenceother
Adversarial Agent Intent Safety Analysis 240K
A dataset of 242,454 adversarial prompts with safety evaluations designed to train AI models to identify dual-use threats and malicious intent hidden within legitimate-sounding requests. Features deep intent analysis across 126 risk vectors to decouple surface interpretation from true capability impact.
Downloads131
Likes9
Technical Profile
- Modalities
- language
- Task Types
- red-teamingsafety-alignmentintent-classification
- Data Format
- parquet
- License
- other
Access
Need custom language data?
Claru builds purpose-built datasets for any environment applications with dense human annotations and quality assurance.
Request a Sample Pack