ExploitBench

coding

ExploitBench is a cybersecurity benchmark that evaluates a model's ability to discover and exploit software vulnerabilities, reported as the fraction of challenges where the model captures the target (Cap%).

Leaderboard

Showing 1 of 1 result

Claude Fable 5

78.0%

i