Benchmark · safety

HarmBench

4 results 2 models
0 3.5 7 10.5 14 2026-06-17 Opus 4.8 · 11.5 · 2026-06-17 Opus 4.8 · 11.5 · 2026-06-17 Fable 5 · 6.1 · 2026-06-17 Fable 5 · 6.1 · 2026-06-17
Opus 4.8 Fable 5
Timeline
  1. 2026-06-17 Opus 4.8 11.5% Red-Team Study Finds Frontier LLMs Remain Vulnerable to Automated Attacks
  2. 2026-06-17 Fable 5 6.1% Red-Team Study Finds Frontier LLMs Remain Vulnerable to Automated Attacks
  3. 2026-06-17 Opus 4.8 11.5% Red-Team Study Finds Frontier LLMs Remain Vulnerable to Adaptive Attacks
  4. 2026-06-17 Fable 5 6.1% Red-Team Study Finds Frontier LLMs Remain Vulnerable to Adaptive Attacks