shBench

Project Vend
LogicBench
EGEBench
по счёту
#МодельСчёт
16
OpenAIGPT 3.5 Turbo
1
15
OpenAIChatGPT 5 Instant
2
15
OpenAIGPT 4.5
2
15
YandexGPTYandexGPT
2
14
GeminiGemma 3 27B
3
14
OpenAIGPT OSS 20B
3
14
KimiKimi K2
3
14
Alibaba (inclusion)Ling-1T
3
14
Alibaba (inclusion)Ring-1T
3
13
GigaChatGigaChat
3.5
13
OpenAIGPT 4o
3.5
13
MiniMaxMiniMax M2
3.5
12
DeepSeekDeepSeek 3.2 Thinking
4
11
OpenAIGPT 4.1
4.5
11
OpenAIGPT 5.1 Instant
4.5
10
AnthropicClaude 4 Sonnet
5
10
AnthropicClaude 4.5 Haiku
5
10
GLMGLM-4.5
5
10
OpenAIGPT OSS 120B
5
10
GrokGrok 4 Heavy
5
10
GeminiNano Banana
5
10
OpenAIo4 Mini
5
9
KimiKimi K2 Thinking
5.5
9
MistralMistral LeChat
5.5
9
OpenAIo1-pro (yupp.ai)
5.5
9
QwenQwen 3 Max
5.5
8
GLMGLM-4.6
6
8
OpenAIo4 Mini High
6
7
AnthropicClaude 4.5 Sonnet
6.5
7
GeminiGemini 2.5 Flash (AI Studio)
6.5
7
GrokGrok 4 Fast
6.5
7
GrokGrok 4.1
6.5
7
OpenAIo3-pro (genspark.ai)
6.5
7
QwenQwen 3 Max Thinking
6.5
6
AnthropicClaude 4.1 Opus
7
6
GeminiGemini 2.5 Pro (AI Studio)
7
6
GeminiGemini Robotics-ER 1.5 Preview
7
6
ManusManus 1.5
7
6
OpenAIo3
7
5
BaiduERNIE 5.0 Preview
7.5
5
OpenAIGPT 5.1 Thinking
7.5
5
GrokGrok 4
7.5
5
GrokGrok 4.1 Thinking
7.5
4
AnthropicClaude 4.5 Opus
8
4
OpenAIGPT-5 Thinkng (Extended)
8
3
OpenAIGPT-5.2
8.5
2
GeminiGemini 2.5 Pro Deep Think
9
2
GeminiGemini 2.5 Pro Deep Think (@loxAIbot)
9
2
OpenAIGPT-5 Pro
9
1
GeminiGemini 3 Flash
10
1
GeminiGemini 3 Pro
10