SecCodeBench

coding

SecCodeBench evaluates LLM coding agents on secure code generation and vulnerability detection, testing the ability to produce code that is both functional and free from security vulnerabilities.

Methodology

Imported from llm-stats public benchmark metadata. Modality: text. Max score: 1. Categories: coding. Language: en. Verified by llm-stats: no.

Leaderboard

  1. Qwen3.5-397B-A17B self-reported llm-stats
    68.3%