NCM plays each Stockfish dev build 20,000 times against Stockfish 14. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.


Host Duration Avg Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo

Test Detail

ID Host Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo CLI PGN


Commit ID 8a9c298deeee372251d95c867f877a7ac3a7c3fb
Author SFisGOD
Date 2018-11-19 09:02:31 UTC
Rook PSQT Tuned Failed STC (Yellow ) LLR: -2.96 (-2.94,2.94) [0.00,4.00] Total: 56302 W: 12007 L: 11953 D: 32342 Passed 1st LTC (Green) LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 8745 W: 1480 L: 1301 D: 5964 Failed 2nd LTC (Red) LLR: -2.96 (-2.94,2.94) [0.00,4.00] Total: 19398 W: 3040 L: 3133 D: 13225 Passed 3rd LTC (Green) LLR: 2.96 (-2.94,2.94) [0.00,4.00] Total: 107516 W: 17342 L: 16858 D: 73316 Closes How to continue from there? The values in the rook table now look a bit strange for a human eye and are hard to explain, maybe it would be nice to simplify them by hand and see if we can pass another (clean) double green with a more regular array. Bench: 3188070
