A framework for cybersecurity evaluation includes a sandboxed target, varied inputs affecting task difficulty, available tools, and a grader to assess outcomes.
Patterns for Building Cybersecurity Evals
from English
A framework for cybersecurity evaluation includes a sandboxed target, varied inputs affecting task difficulty, available tools, and a grader to assess outcomes.