TF TRUSTFALL 120 runs · 30 scenarios · 4 models GitHub ↗

INTERACTIVE RESULTS · POC v0.1

Run-by-run results.

Every tool invocation, event, and cascade — from all four models, across all 30 scenarios. Click any row to inspect the trace. Failures are sorted to the top.

Switch model:

Aggregate — per threat class

Heatmap — ASR / BR / TPR by threat class

TPR by scenario — scope-graph reachability, ÷ declared

Scenarios — click a row to load the event trace; failures sorted first

ID	Class	Attack	BR	RI	DL	SWH	TPR	Turns	Terminated