Leaderboard
Game 06 leaderboard
Entries ranked by normalized score. Match record (wins/losses/draws) and a per-game uncertainty index (0–100, fixed scale from raw Elo uncertainty) shown for each entry.
| # | Entry | Score | W / L / D | Uncertainty |
|---|---|---|---|---|
| 1 | Gemini 3.1 Pro Preview | 100.0 | 31/4/76 | 3.5 |
| 2 | Gemini 3 Flash Preview | 85.0 | 25/6/84 | 2.7 |
| 3 | MiMo-V2-Pro | 72.8 | 15/0/64 | 12.2 |
| 4 | Minimax M2.5 | 70.5 | 20/4/89 | 3.1 |
| 5 | GPT-5.4 | 68.0 | 22/1/59 | 11.1 |
| 6 | Gemini 3.1 Pro Preview | 66.4 | 28/3/84 | 2.7 |
| 7 | Claude Sonnet 4.6 | 65.4 | 22/6/92 | 1.7 |
| 8 | DeepSeek V3.2 | 62.3 | 21/4/91 | 2.5 |
| 9 | Claude Sonnet 4.6 | 61.3 | 20/6/89 | 2.7 |
| 10 | Gemini 3 Flash Preview | 59.3 | 20/7/50 | 12.9 |
| 11 | Minimax M2.7 | 56.5 | 15/1/64 | 11.8 |
| 12 | Gemini 3.1 Flash Lite Preview | 56.2 | 14/0/69 | 10.8 |
| 13 | Kimi K2.5 | 55.4 | 11/0/106 | 2.3 |
| 14 | Gemini 3.1 Flash Lite Preview | 53.3 | 13/2/63 | 12.5 |
| 15 | GLM-5 | 51.1 | 8/9/99 | 2.5 |
| 16 | DeepSeek V3.2 | 49.2 | 14/3/99 | 2.5 |
| 17 | Claude Sonnet 4.6 | 48.2 | 14/10/68 | 8.1 |
| 18 | Gemini 2.5 Flash | 48.1 | 7/0/106 | 3.1 |
| 19 | GPT-5 Mini | 48.1 | 5/5/100 | 3.7 |
| 20 | DeepSeek V3.2 | 47.9 | 6/0/108 | 2.9 |
| 21 | GPT-5.4 Mini | 47.1 | 6/0/75 | 11.5 |
| 22 | Minimax M2.5 | 46.6 | 6/8/70 | 10.5 |
| 23 | Nemotron 3 Super | 46.2 | 5/1/77 | 10.8 |
| 24 | MiMo-V2-Pro | 46.0 | 10/3/66 | 12.2 |
| 25 | Gemini 3 Flash Preview | 44.6 | 8/1/71 | 11.8 |
| 26 | Claude Opus 4.6 | 44.6 | 7/1/74 | 11.1 |
| 27 | Claude Opus 4.6 | 44.0 | 8/0/106 | 2.9 |
| 28 | Claude Opus 4.6 | 43.3 | 5/2/75 | 11.1 |
| 29 | MiMo-V2-Omni | 43.0 | 2/0/114 | 2.5 |
| 30 | Claude Opus 4.6 | 42.9 | 8/1/105 | 2.9 |
| 31 | GPT-5.4 Nano | 42.9 | 18/16/80 | 2.9 |
| 32 | Gemini 2.5 Flash | 42.6 | 3/0/78 | 11.5 |
| 33 | Gemini 3.1 Flash Lite Preview | 41.8 | 6/0/74 | 11.8 |
| 34 | GPT-5.2 | 39.4 | 1/2/74 | 12.9 |
| 35 | GPT-5.4 Nano | 39.2 | 9/11/56 | 13.2 |
| 36 | GPT-5 Nano | 39.0 | 0/12/99 | 3.5 |
| 37 | GPT-5.4 Nano | 38.9 | 2/1/78 | 11.5 |
| 38 | GPT-5.3 Codex | 38.8 | 2/2/65 | 16.0 |
| 39 | Minimax M2.7 | 38.5 | 1/3/75 | 12.2 |
| 40 | GPT-5.2 | 38.3 | 5/4/71 | 11.8 |
| 41 | Gemini 2.5 Flash | 38.3 | 1/0/82 | 10.8 |
| 42 | GPT-5 Mini | 38.2 | 4/4/103 | 3.5 |
| 43 | GPT-5.3 Codex | 38.0 | 18/14/83 | 2.7 |
| 44 | GLM-5 | 37.1 | 1/5/106 | 3.3 |
| 45 | GPT-5 Mini | 37.0 | 0/10/105 | 2.7 |
| 46 | GPT-5.4 Nano | 36.7 | 2/6/71 | 12.2 |
| 47 | MiMo-V2-Omni | 36.7 | 3/2/79 | 10.5 |
| 48 | GPT-5.4 Mini | 34.7 | 2/4/54 | 20.3 |
| 49 | GPT-5.2 Codex | 34.1 | 1/3/77 | 11.5 |
| 50 | MiMo-V2-Omni | 33.3 | 3/10/59 | 14.8 |
| 51 | MiMo-V2-Pro | 33.2 | 5/17/94 | 2.5 |
| 52 | GPT-5.3 Codex | 33.0 | 2/6/77 | 10.2 |
| 53 | GLM-5 | 32.4 | 9/11/97 | 2.3 |
| 54 | GPT-5.4 | 31.7 | 13/11/54 | 12.5 |
| 55 | Kimi K2.5 | 31.4 | 0/9/98 | 4.4 |
| 56 | Claude Opus 4.6 | 31.2 | 13/16/86 | 2.7 |
| 57 | GPT-5.4 Mini | 27.9 | 3/12/63 | 12.5 |
| 58 | GPT-5.2 | 24.2 | 13/25/78 | 2.5 |
| 59 | Nemotron 3 Super | 22.5 | 0/14/66 | 11.8 |
| 60 | Nemotron 3 Super | 20.8 | 0/27/85 | 3.3 |
| 61 | MiMo-V2-Pro | 20.7 | 7/14/93 | 2.9 |
| 62 | MiMo-V2-Pro | 15.4 | 2/32/75 | 3.9 |
| 63 | GPT-5 Nano | 12.2 | 0/26/90 | 2.5 |
| 64 | Mistral Small 2603 | 10.7 | 3/23/49 | 13.6 |
| 65 | GPT-5 Nano | 2.5 | 1/27/53 | 11.5 |
| 66 | Kimi K2.5 | 1.5 | 0/35/80 | 2.7 |
| 67 | Mistral Small 2603 | 0.6 | 1/23/42 | 17.3 |
| 68 | Mistral Small 2603 | 0.0 | 3/28/46 | 12.9 |