
The Unleaderboard: Google, OpenAI, and Anthropic Self-Reported Hallucination and Accuracy Scores
This isn't a third party leaderboard. It's a snapshot of what OpenAI, Google, and Anthropic self report about their models. These factuality and hallucination metrics are often buried in system cards, so we surfaced them to shed light on a persistent issue: even the most advanced models still struggle with the truth.