FActScore

reasoning

A fine-grained atomic evaluation metric for factual precision in long-form text generation that breaks generated text into atomic facts and computes the percentage supported by reliable knowledge sources, with automated assessment using retrieval and language models

Leaderboard

Showing 2 of 2 results

Grok-4.1

97.0%

i
GPT-5

1.0%

i