Tau3 Telecom

reasoning

τ³-Bench telecom domain evaluates agentic models on multi-turn, tool-using customer-support and troubleshooting scenarios in a simulated telecommunications environment.

Leaderboard

Showing 1 of 1 result

Mistral Medium 3.5

91.4%

i