Dev Builds » 20230824-0611

Use this dev build

NCM plays each Stockfish dev build 20,000 times against Stockfish 14. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.

Summary

Host Duration Avg Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo
ncm-dbt-01 09:56:38 1135961 3344 1463 279 1602 +128.58 +/- 5.51 0 49 448 1117 58 +283.86 +/- 16.11
ncm-dbt-02 09:52:28 1197053 3310 1444 282 1584 +127.39 +/- 5.22 0 27 486 1095 47 +284.03 +/- 15.42
ncm-dbt-03 09:56:36 1228103 3346 1442 284 1620 +125.42 +/- 5.39 1 40 482 1100 50 +277.23 +/- 15.53
ncm-dbt-04 09:56:42 1237681 3348 1431 292 1625 +123.1 +/- 5.46 0 45 500 1074 55 +267.9 +/- 15.24
ncm-dbt-05 09:52:23 1224036 3310 1440 281 1589 +127.03 +/- 5.32 1 35 469 1104 46 +283.65 +/- 15.74
ncm-dbt-06 09:56:17 1217458 3342 1433 288 1621 +124.05 +/- 5.24 1 35 492 1104 39 +276.98 +/- 15.35
20000 8653 1706 9641 +125.92 +/- 2.19 3 231 2877 6594 295 +278.86 +/- 6.35

Test Detail

ID Host Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo CLI PGN
200770 ncm-dbt-02 1198391 310 139 27 144 +131.46 +/- 16.98 0 2 44 104 5 +294.82 +/- 52.08
200769 ncm-dbt-05 1217201 310 136 22 152 +134.04 +/- 16.9 0 2 42 106 5 +303.54 +/- 53.43
200768 ncm-dbt-01 1140702 344 154 20 170 +142.88 +/- 17.11 0 3 41 119 9 +320.26 +/- 54.29
200767 ncm-dbt-06 1238779 342 144 27 171 +123.85 +/- 15.13 0 1 55 112 3 +279.59 +/- 45.95
200766 ncm-dbt-03 1217240 346 143 37 166 +109.97 +/- 16.13 0 4 62 104 3 +238.32 +/- 43.44
200765 ncm-dbt-04 1243845 348 150 28 170 +127.19 +/- 17.08 0 6 45 118 5 +283.21 +/- 51.55
200764 ncm-dbt-02 1198149 500 219 39 242 +130.94 +/- 12.56 0 2 71 172 5 +301.33 +/- 40.6
200763 ncm-dbt-05 1211164 500 225 47 228 +129.35 +/- 14.75 1 8 61 172 8 +290.66 +/- 44.0
200762 ncm-dbt-06 1200808 500 207 39 254 +121.45 +/- 12.58 0 4 76 168 2 +277.93 +/- 39.29
200761 ncm-dbt-01 1136685 500 226 36 238 +138.99 +/- 14.39 0 7 57 175 11 +312.48 +/- 45.66
200760 ncm-dbt-03 1221084 500 220 45 235 +126.97 +/- 13.94 0 7 68 168 7 +282.94 +/- 41.76
200759 ncm-dbt-04 1219901 500 226 44 230 +132.54 +/- 13.83 0 5 67 169 9 +295.94 +/- 42.08
200758 ncm-dbt-02 1195073 500 220 50 230 +123.02 +/- 13.82 0 3 84 153 10 +263.42 +/- 37.11
200757 ncm-dbt-05 1233175 500 217 45 238 +124.6 +/- 14.47 0 8 71 162 9 +270.57 +/- 40.83
200756 ncm-dbt-06 1209132 500 235 46 219 +138.18 +/- 13.87 0 3 67 168 12 +306.84 +/- 41.99
200755 ncm-dbt-03 1239240 500 207 43 250 +118.33 +/- 13.86 0 7 78 159 6 +258.75 +/- 38.88
200754 ncm-dbt-04 1227250 500 218 36 246 +132.54 +/- 13.65 0 4 69 168 9 +295.94 +/- 41.39
200753 ncm-dbt-01 1130102 500 213 44 243 +122.24 +/- 13.66 0 8 69 169 4 +275.45 +/- 41.43
200752 ncm-dbt-05 1221462 500 220 41 239 +130.14 +/- 13.16 0 5 66 174 5 +298.62 +/- 42.41
200751 ncm-dbt-02 1193530 500 214 48 238 +119.89 +/- 14.02 0 8 74 162 6 +263.42 +/- 39.97
200750 ncm-dbt-06 1209275 500 215 49 236 +119.89 +/- 14.51 0 8 77 156 9 +256.44 +/- 39.15
200749 ncm-dbt-03 1238534 500 225 46 229 +130.15 +/- 14.4 1 4 70 165 10 +288.06 +/- 41.11
200748 ncm-dbt-04 1225778 500 221 43 236 +129.35 +/- 14.58 0 6 72 160 12 +277.93 +/- 40.53
200747 ncm-dbt-01 1138782 500 216 33 251 +133.34 +/- 14.34 0 6 66 167 11 +293.29 +/- 42.41
200746 ncm-dbt-02 1194172 500 210 53 237 +112.91 +/- 14.05 0 6 89 147 8 +238.66 +/- 36.16
200745 ncm-dbt-05 1238614 500 209 45 246 +118.33 +/- 13.35 0 4 84 156 6 +258.75 +/- 37.2
200744 ncm-dbt-06 1221472 500 209 45 246 +118.33 +/- 12.81 0 5 78 165 2 +268.17 +/- 38.8
200743 ncm-dbt-04 1243813 500 211 54 235 +112.91 +/- 13.56 0 7 83 156 4 +247.41 +/- 37.61
200742 ncm-dbt-03 1229606 500 217 36 247 +131.74 +/- 14.2 0 6 67 167 10 +290.66 +/- 42.08
200741 ncm-dbt-01 1131372 500 224 46 230 +129.35 +/- 14.24 0 9 61 173 7 +290.66 +/- 44.0
200740 ncm-dbt-05 1228312 500 211 33 256 +129.35 +/- 12.21 0 1 74 171 4 +298.62 +/- 39.56
200739 ncm-dbt-02 1194652 500 217 34 249 +133.34 +/- 13.26 0 4 66 173 7 +304.07 +/- 42.38
200738 ncm-dbt-04 1255765 500 218 43 239 +126.97 +/- 13.94 0 5 74 162 9 +277.93 +/- 39.92
200737 ncm-dbt-06 1218963 500 208 43 249 +119.11 +/- 14.03 1 7 72 166 4 +268.17 +/- 40.54
200736 ncm-dbt-01 1135925 500 208 46 246 +116.77 +/- 13.71 0 9 73 165 3 +261.07 +/- 40.24
200735 ncm-dbt-03 1223956 500 210 34 256 +127.76 +/- 13.57 0 6 68 170 6 +288.06 +/- 41.76
200734 ncm-dbt-05 1218325 500 222 48 230 +126.17 +/- 14.29 0 7 71 163 9 +275.45 +/- 40.84
200733 ncm-dbt-02 1205404 500 225 31 244 +142.25 +/- 12.37 0 2 58 184 6 +339.63 +/- 45.29
200732 ncm-dbt-06 1223777 500 215 39 246 +127.76 +/- 13.92 0 7 67 169 7 +285.49 +/- 42.08
200731 ncm-dbt-01 1138165 500 222 54 224 +121.45 +/- 14.98 0 7 81 149 13 +251.89 +/- 38.11
200730 ncm-dbt-03 1227065 500 220 43 237 +128.55 +/- 13.91 0 6 69 167 8 +285.49 +/- 41.44
200729 ncm-dbt-04 1247416 500 187 44 269 +102.22 +/- 14.78 0 12 90 141 7 +211.87 +/- 36.13

Commit

Commit ID 4c4cb185aaaa0b3175ca35ab6473f17e9ec64055
Author Stéphane Nicolet
Date 2023-08-24 06:11:17 UTC
Play turbulent when defending, simpler when attacking This patch decays a little the evaluation (up to a few percent) for positions which have a large complexity measure (material imbalance, positional compensations, etc). This may have nice consequences on the playing style, as it modifies the search differently for attack and defense, both effects being desirable: - to see the effect on positions when Stockfish is defending, let us suppose for instance that the side to move is Stockfish and the nnue evaluation on the principal variation is -100 : this patch will decay positions with an evaluation of -103 (say) to the same level, provided they have huge material imbalance or huge positional compensation. In other words, chaotic positions with an evaluation of -103 are now comparable in our search tree to stable positions with an evaluation of -100, and chaotic positions with an evaluation of -102 are now preferred to stable positions with an evaluation of -100. - the effect on positions when Stockfish is attacking is the opposite. Let us suppose for instance that the side to move is Stockfish and the nnue evaluation on the principal variation is +100 : this patch will decay the evaluation to +97 if the positions on the principal variation have huge material imbalance or huge positional compensation. In other words, stable positions with an evaluation of +97 are now comparable in our search tree to chaotic positions with an evaluation of +100, and stable positions with an evaluation of +98 are now preferred to chaotic positions with an evaluation of +100. So the effect of this small change of evaluation on the playing style is that Stockfish should now play a little bit more turbulent when defending, and choose slightly simpler lines when attacking. passed STC: LLR: 2.93 (-2.94,2.94) <0.00,2.00> Total: 268448 W: 68713 L: 68055 D: 131680 Ptnml(0-2): 856, 31514, 68943, 31938, 973 https://tests.stockfishchess.org/tests/view/64e252bb99700912526653ed passed LTC: LLR: 2.94 (-2.94,2.94) <0.50,2.50> Total: 141060 W: 36066 L: 35537 D: 69457 Ptnml(0-2): 71, 15179, 39522, 15666, 92 https://tests.stockfishchess.org/tests/view/64e4447a9009777747553725 closes https://github.com/official-stockfish/Stockfish/pull/4762 Bench: 1426295
Copyright 2011–2024 Next Chess Move LLC