比较和追踪最新 AI 模型性能排名

AI 排行榜

构建时从 Artificial Analysis 获取的模型评估和定价快照。

#1 Claude Opus 4.8 (Adaptive Reasoning, Max Effort) · 智能 61.4

排名模型创建者智能编程数学混合 $/1MTok/sTTFT (秒)
1最佳
Claude Opus 4.8 (Adaptive Reasoning, Max Effort)
claude-opus-4-8
Anthropic61.456.7$10.93859.812.48
2
GPT-5.5 (xhigh)
gpt-5-5
OpenAI60.259.1$11.2569.537.98
3
GPT-5.5 (high)
gpt-5-5-high
OpenAI58.958.5$11.2569.917.38
4
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
claude-opus-4-7
Anthropic57.352.5$10.93859.121.03
5
Gemini 3.1 Pro Preview
gemini-3-1-pro-preview
Google57.255.5$4.5135.922.92
6
GPT-5.4 (xhigh)
gpt-5-4
OpenAI56.857.2$5.62581.8163.99
7
GPT-5.5 (medium)
gpt-5-5-medium
OpenAI56.756.2$11.2567.54.84
8
Qwen3.7 Max
qwen3-7-max
Alibaba56.650.1$3.75204.61.68
9
Gemini 3.5 Flash (high)
gemini-3-5-flash
Google55.345$3.375226.313.2
10
Gemini 3.5 Flash (medium)
gemini-3-5-flash-medium
Google54.843.9$3.375212.611.43
11
Kimi K2.6
kimi-k2-6
Kimi53.947.1$1.71239.91.31
12
MiMo-V2.5-Pro
mimo-v2-5-pro
Xiaomi53.845.5$0.54453.21.93
13
GPT-5.3 Codex (xhigh)
gpt-5-3-codex
OpenAI53.653.1$4.81383.958.22
14
Grok 4.3 (high)
grok-4-3
xAI53.241$1.563130.123.5
15
Claude Opus 4.6 (Adaptive Reasoning, Max Effort)
claude-opus-4-6-adaptive
Anthropic52.948.1$10.93849.811.03
16
Muse Spark
muse-spark
Meta52.247.5$000
17
Claude Opus 4.7 (Non-reasoning, High Effort)
claude-opus-4-7-non-reasoning
Anthropic51.853.1$10.93849.51.1
18
Qwen3.6 Max Preview
qwen3-6-max
Alibaba51.844.9$2.92540.41.92
19
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)
claude-sonnet-4-6-adaptive
Anthropic51.750.9$6.56363.970.8
20
DeepSeek V4 Pro (Reasoning, Max Effort)
deepseek-v4-pro
DeepSeek51.547.5$0.54453.71.19
21
GLM-5.1 (Reasoning)
glm-5-1
Z AI51.443.4$2.1555.81.09
22
GPT-5.2 (xhigh)
gpt-5-2
OpenAI51.348.799$4.81380.397.55
23
GPT-5.5 (low)
gpt-5-5-low
OpenAI50.852.1$11.2564.11.82
24
Qwen3.6 Plus
qwen3-6-plus
Alibaba5042.9$1.12552.81.87
25
DeepSeek V4 Pro (Reasoning, High Effort)
deepseek-v4-pro-high
DeepSeek49.843.2$0.54455.21.17

按Artificial Analysis 智能指数排序。

显示 25 个模型。