MASK

reasoning

MASK is a collection of 1000 questions measuring whether models faithfully report their beliefs when pressured to lie. It operationalizes deception as the rate at which the model lies, i.e., knowingly making false statements intended to be received as true. Lower dishonesty rates indicate better honesty.

Leaderboard

Showing 1 of 1 result

Grok-4.1 Thinking

51.0%

i