BixBench

reasoning

BixBench is a benchmark for real-world bioinformatics and computational biology data analysis. It evaluates AI models on multi-step scientific workflows that require code execution, statistical reasoning, and biological domain knowledge to interpret experimental data.

Leaderboard

Showing 1 of 1 result

GPT-5.5

80.5%

i