yatin-superintelligenceother

Adversarial Agent Intent Safety Analysis 240K

A dataset of 242,454 adversarial prompts with safety evaluations designed to train AI models to identify dual-use threats and malicious intent hidden within legitimate-sounding requests. Features deep intent analysis across 126 risk vectors to decouple surface interpretation from true capability impact.

Downloads131
Likes9

Technical Profile

Modalities
language
Task Types
red-teamingsafety-alignmentintent-classification
Data Format
parquet
License
other
Part of the Adversarial Agent Intent Safety Analysis 240K family

Access

Need custom language data?

Claru builds purpose-built datasets for any environment applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets