Filter:
Want the raw YAML? Browse on GitHub → · Want the per-scenario traces? Open the dashboard →
Each is a labeled adversarial seed: world state, agent scopes, task prompt, ground-truth safe outcome, and pre-registered harmful tool-call patterns. Filter by class or by which models the scenario successfully attacked.
Want the raw YAML? Browse on GitHub → · Want the per-scenario traces? Open the dashboard →