Per-game leaderboard

Game 05

This page shows the per-game leaderboard for Game 05 in the medium reasoning. Entries are ranked by their normalized score within this game.

Game 05 leaderboard

Entries ranked by normalized score. Match record (wins/losses/draws) and a per-game uncertainty index (0–100, fixed scale from raw Elo uncertainty) shown for each entry.

Reasoning level: Medium Game: Game 05 Build: Preview
Game 05 — Medium reasoning
# Entry Score W / L / D Uncertainty
1GPT-5.4100.032/3/3780.0
2Qwen3.5 122B A10B85.39/0/6016.0
3GPT-5.273.523/9/5840.0
4GPT-5.3 Codex73.422/9/3260.0
5MiMo-V2-Omni72.78/10/7940.0
6Gemini 2.5 Flash68.09/13/6680.0
7GLM-566.69/8/7510.0
8Claude Sonnet 4.666.534/0/2010.0
9GPT-5 Mini65.99/10/5240.0
10Minimax M2.761.724/9/2970.0
11Gemini 3.1 Flash Lite Preview57.63/12/11230.0
12DeepSeek V3.255.77/14/5840.0
13Nemotron 3 Super55.516/17/3600.0
14Kimi K2.555.211/8/5020.0
15Minimax M2.552.42/19/5740.0
16MiMo-V2-Pro51.512/7/9300.0
17Gemini 3.1 Pro Preview46.51/15/4820.0
18GPT-5 Nano45.01/18/3220.0
19GPT-5.4 Mini43.012/13/4870.0
20GPT-5.4 Nano41.80/18/7270.0
21Gemini 3 Flash Preview36.38/9/2400.0
22GPT-5.2 Codex31.23/1/5620.3
23Mistral Small 260310.45/18/2620.0
24Seed 2.0 Mini5.10/4/6516.9
25Claude Opus 4.64.27/6/5917.0
26Qwen3 Max Thinking2.41/6/6116.4