NCM plays each Stockfish dev build 20,000 times against Stockfish 15. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.
| Host | Duration | Avg Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo |
|---|
| ID | Host | Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo | CLI | PGN |
|---|
| Commit ID | 83514e4970194da7ef40630ec7df7d9c32cc5bdc |
|---|---|
| Author | Syine Mineta |
| Date | 2026-07-03 18:22:16 UTC |
|
Filter invalid threat pairs early
Passed STC:
LLR: 2.94 (-2.94,2.94) <0.00,2.00>
Total: 246400 W: 63981 L: 63391 D: 119028
Ptnml(0-2): 612, 26055, 69299, 26599, 635
https://tests.stockfishchess.org/tests/view/6a3fafbf3036e45021aebaa0
Measured 1% speedup locally on x86-64-avx512icl build (bench 512 1 16, 50 runs):
PASSED: speedup = +0.0104, P(speedup > 0) = 1.0000
closes https://github.com/official-stockfish/Stockfish/pull/6936
No functional change
|
|