Compare and track the latest AI model performance rankings
AI Leaderboards
Build-time snapshot of model evaluations and pricing from Artificial Analysis.
#1 GPT-5.2 (xhigh) · Intelligence 51.2
Source: artificialanalysis.ai
| Rank | Model | Creator | Intelligence | Coding | Math | Blended $/1M | Tok/s | TTFT (s) |
|---|---|---|---|---|---|---|---|---|
1最佳 | GPT-5.2 (xhigh) gpt-5-2 | OpenAI | 51.2 | 48.7 | 99 | $4.813 | 103.9 | 27 |
2 | Claude Opus 4.5 (Reasoning) claude-opus-4-5-thinking | Anthropic | 49.7 | 47.8 | 91.3 | $10 | 82.1 | 1.75 |
3 | GPT-5.2 Codex (xhigh) gpt-5-2-codex | OpenAI | 49 | 43 | — | $4.813 | 101.3 | 25.5 |
4 | Gemini 3 Pro Preview (high) gemini-3-pro | 48.4 | 46.5 | 95.7 | $4.5 | 126.2 | 31.63 | |
5 | GPT-5.1 (high) gpt-5-1 | OpenAI | 47.6 | 44.7 | 94 | $3.438 | 139.1 | 25.58 |
6新 | Kimi K2.5 (Reasoning) kimi-k2-5 | Kimi | 46.8 | 39.5 | — | $1.2 | 109.5 | 0.86 |
7 | GPT-5.2 (medium) gpt-5-2-medium | OpenAI | 46.6 | 44.2 | 96.7 | $4.813 | 0 | 0 |
8 | Gemini 3 Flash Preview (Reasoning) gemini-3-flash-reasoning | 46.4 | 42.6 | 97 | $1.125 | 203.3 | 12.44 | |
9 | GPT-5 (high) gpt-5 | OpenAI | 44.6 | 36 | 94.3 | $3.438 | 129.3 | 96.78 |
10 | GPT-5 Codex (high) gpt-5-codex | OpenAI | 44.5 | 38.9 | 98.7 | $3.438 | 344.1 | 11.18 |
11 | Claude Opus 4.5 (Non-reasoning) claude-opus-4-5 | Anthropic | 43 | 42.9 | 62.7 | $10 | 79 | 1.95 |
12 | Claude 4.5 Sonnet (Reasoning) claude-4-5-sonnet-thinking | Anthropic | 42.9 | 38.6 | 88 | $6 | 82.6 | 1.32 |
13 | GPT-5.1 Codex (high) gpt-5-1-codex | OpenAI | 42.2 | 36.6 | 95.7 | $3.438 | 254.6 | 9.94 |
14 | GLM-4.7 (Reasoning) glm-4-7 | Z AI | 42 | 36.3 | 95 | $0.875 | 136.5 | 0.84 |
15 | GPT-5 (medium) gpt-5-medium | OpenAI | 41.8 | 39 | 91.7 | $3.438 | 137.1 | 42.94 |
16 | DeepSeek V3.2 (Reasoning) deepseek-v3-2-reasoning | DeepSeek | 41.6 | 36.7 | 92 | $0.315 | 32.3 | 1.1 |
17 | Grok 4 grok-4 | xAI | 41.4 | 40.5 | 92.7 | $6 | 33.3 | 8.24 |
18 | Gemini 3 Pro Preview (low) gemini-3-pro-low | 41.1 | 39.4 | 86.7 | $4.5 | 125.3 | 4.13 | |
19 | GPT-5 mini (high) gpt-5-mini | OpenAI | 41 | 35.3 | 90.7 | $0.688 | 79.5 | 107.34 |
20 | o3 o3 | OpenAI | 40.9 | 38.4 | 88.3 | $3.5 | 230.8 | 11.67 |
21 | Kimi K2 Thinking kimi-k2-thinking | Kimi | 40.7 | 34.8 | 94.7 | $1.075 | 96 | 0.61 |
22 | o3-pro o3-pro | OpenAI | 40.7 | — | — | $35 | 36.9 | 74.73 |
23 | Qwen3 Max Thinking qwen3-max-thinking | Alibaba | 39.7 | 30.5 | — | $2.4 | 40.2 | 2.02 |
24 | MiniMax-M2.1 minimax-m2-1 | MiniMax | 39.5 | 32.8 | 82.7 | $0.525 | 68.1 | 1.45 |
25 | MiMo-V2-Flash (Reasoning) mimo-v2-flash-reasoning | Xiaomi | 39.2 | 31.8 | 96.3 | $0.15 | 156.6 | 1.16 |
Sorted by Artificial Analysis Intelligence Index.
Showing 25 models.
CC BY-NC 4.0·2026 © Dimitri POSTOLOV
RSS