Blueprint-Bench 2

reasoning

Blueprint-Bench 2 is an agentic spatial reasoning benchmark that evaluates a model's ability to understand, plan, and reason over architectural blueprints and other structured spatial documents. Scores are reported as a normalized score.

Leaderboard

Showing 2 of 2 results

Claude Fable 5

38.6%

i
Gemini 3.5 Flash

33.6%

i