
LLM Leaderboard
Compare how different large language models perform at writing Clerk code and select the one that best fits your requirements.
Avg. MCP Improvement:+8%
| Model / Average | Organizations | Billing | Webhooks | Auth | Quickstarts | UI Components | User Management |
|---|---|---|---|---|---|---|---|
1 v0-1.5-md 71% | 91% | 62% | 75% | 60% | 76% | 86% | 50% |
2 Claude Opus 4.6 68% | 68% | 63% | 91% | 49% | 93% | 68% | 42% |
3 GPT-5.2 67% | 77% | 67% | 87% | 54% | 91% | 66% | 25% |
4 Claude Opus 4.5 66% | 75% | 60% | 87% | 54% | 93% | 71% | 25% |
5 GPT-5 66% | 67% | 68% | 87% | 37% | 91% | 62% | 50% |
6 Gemini 3 Pro Preview 64% | 54% | 62% | 68% | 54% | 91% | 78% | 42% |
7 Claude Sonnet 4.5 64% | 48% | 63% | 74% | 43% | 83% | 86% | 50% |
8 Claude Sonnet 4 63% | 53% | 60% | 72% | 49% | 91% | 86% | 33% |
9 GPT-5 Chat 63% | 69% | 66% | 81% | 38% | 89% | 66% | 33% |
10 v0-1.5-lg 62% | 49% | 60% | 59% | 59% | 67% | 74% | 67% |
11 Claude Opus 4 60% | 40% | 60% | 78% | 38% | 91% | 81% | 33% |
12 GPT-5.2 Codex 59% | 74% | 63% | 78% | 38% | 54% | 64% | 42% |
13 Claude Haiku 4.5 58% | 62% | 66% | 68% | 44% | 71% | 71% | 25% |
14 GPT-4o 39% | 58% | 0% | 60% | 16% | 91% | 34% | 17% |
15 Gemini 2.5 Flash 24% | 0% | 0% | 0% | 0% | 91% | 79% | 0% |
Last updated: February 5, 2026
Trusted by fast-growing companies around the world.

