Dev Builds » 20240505-1104

You are viewing an old NCM Stockfish dev build test. You may find the most recent dev build tests using Stockfish 15 as the baseline here.

Use this dev build

NCM plays each Stockfish dev build 20,000 times against Stockfish 14. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.

Summary

Host Duration Avg Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo
ncm-dbt-01 11:45:12 1131543 4010 1744 327 1939 +128.3 +/- 4.91 1 51 541 1354 58 +286.97 +/- 14.66
ncm-dbt-02 11:44:50 1197269 3992 1780 292 1920 +136.06 +/- 4.72 0 33 503 1399 61 +311.72 +/- 15.2
ncm-dbt-03 11:42:41 1237224 4000 1751 285 1964 +133.54 +/- 4.88 0 47 505 1383 65 +301.67 +/- 15.17
ncm-dbt-05 11:44:05 1196787 3998 1781 306 1911 +134.52 +/- 4.85 0 43 504 1386 66 +304.66 +/- 15.19
ncm-dbt-06 11:44:40 1229306 4000 1776 275 1949 +137.07 +/- 4.81 1 35 495 1400 69 +312.84 +/- 15.33
20000 8832 1485 9683 +133.89 +/- 2.16 2 209 2548 6922 319 +303.38 +/- 6.75

Test Detail

ID Host Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo CLI PGN
366921 ncm-dbt-01 1125720 10 5 0 5 +190.62 +/- 11.11 0 0 0 5 0 +1199.83 +/- 231.31
366920 ncm-dbt-02 1192468 492 215 43 234 +126.81 +/- 13.36 0 5 69 167 5 +287.33 +/- 41.42
366919 ncm-dbt-05 1194913 498 219 42 237 +129.12 +/- 13.4 0 6 65 173 5 +295.12 +/- 42.75
366918 ncm-dbt-06 1252143 500 221 26 253 +143.07 +/- 13.15 1 2 55 185 7 +342.85 +/- 46.64
366917 ncm-dbt-03 1241585 500 222 32 246 +138.99 +/- 13.29 0 4 60 178 8 +321.19 +/- 44.57
366916 ncm-dbt-01 1123816 500 229 48 223 +131.74 +/- 13.85 0 3 74 162 11 +288.06 +/- 39.79
366915 ncm-dbt-02 1198497 500 228 28 244 +147.19 +/- 13.18 0 4 51 186 9 +349.43 +/- 48.52
366914 ncm-dbt-06 1209655 500 231 28 241 +149.68 +/- 13.27 0 2 55 181 12 +349.43 +/- 46.59
366913 ncm-dbt-03 1234180 500 212 36 252 +127.76 +/- 13.02 0 2 77 164 7 +285.49 +/- 38.83
366912 ncm-dbt-05 1196489 500 229 43 228 +135.76 +/- 15.31 0 10 57 170 13 +295.94 +/- 45.36
366911 ncm-dbt-01 1138496 500 218 40 242 +129.35 +/- 13.9 0 6 68 168 8 +288.06 +/- 41.76
366910 ncm-dbt-06 1230175 500 218 30 252 +137.37 +/- 14.25 0 7 58 175 10 +309.64 +/- 45.27
366909 ncm-dbt-05 1200180 500 229 45 226 +134.15 +/- 13.97 0 6 63 172 9 +301.33 +/- 43.45
366908 ncm-dbt-03 1235241 500 222 31 247 +139.81 +/- 13.45 0 3 63 174 10 +318.25 +/- 43.39
366907 ncm-dbt-02 1194852 500 222 34 244 +137.37 +/- 13.34 0 4 62 176 8 +315.35 +/- 43.81
366906 ncm-dbt-01 1131680 500 215 42 243 +125.38 +/- 13.79 0 6 72 165 7 +277.93 +/- 40.53
366819 ncm-dbt-02 1190677 500 216 41 243 +126.97 +/- 13.04 0 4 72 169 5 +288.06 +/- 40.46
366818 ncm-dbt-05 1194193 500 213 34 253 +130.14 +/- 13.34 0 7 61 178 4 +301.33 +/- 44.13
366817 ncm-dbt-01 1140571 500 213 36 251 +128.55 +/- 13.73 0 6 68 169 7 +288.06 +/- 41.76
366816 ncm-dbt-06 1228991 500 222 46 232 +127.76 +/- 14.1 0 6 71 164 9 +280.42 +/- 40.83
366815 ncm-dbt-03 1237499 500 224 32 244 +140.62 +/- 14.35 0 9 49 183 9 +324.17 +/- 48.88
366814 ncm-dbt-02 1194133 500 227 43 230 +134.15 +/- 13.8 0 6 62 174 8 +304.07 +/- 43.81
366813 ncm-dbt-05 1193766 500 221 32 247 +138.18 +/- 12.93 0 4 59 181 6 +324.17 +/- 44.97
366812 ncm-dbt-01 1115376 500 224 51 225 +125.39 +/- 15.27 1 8 69 161 11 +270.57 +/- 41.4
366811 ncm-dbt-06 1219676 500 215 37 248 +129.35 +/- 13.9 0 4 74 162 10 +282.94 +/- 39.86
366810 ncm-dbt-03 1221937 500 221 47 232 +126.17 +/- 14.29 0 8 68 166 8 +277.93 +/- 41.74
366809 ncm-dbt-02 1197476 500 227 37 236 +138.99 +/- 12.7 0 2 63 178 7 +324.17 +/- 43.32
366808 ncm-dbt-05 1196877 500 233 31 236 +148.85 +/- 13.51 0 3 54 181 12 +346.12 +/- 47.1
366807 ncm-dbt-01 1126589 500 212 40 248 +124.6 +/- 12.9 0 5 71 171 3 +285.49 +/- 40.81
366806 ncm-dbt-06 1235468 500 225 41 234 +134.15 +/- 13.61 0 5 64 173 8 +304.07 +/- 43.1
366805 ncm-dbt-03 1239168 500 213 34 253 +130.14 +/- 14.74 0 10 60 171 9 +288.06 +/- 44.27
366804 ncm-dbt-02 1198609 500 227 33 240 +142.25 +/- 13.75 0 5 56 179 10 +327.18 +/- 46.19
366803 ncm-dbt-05 1198372 500 217 38 245 +130.14 +/- 14.4 0 5 73 160 12 +280.42 +/- 40.21
366802 ncm-dbt-01 1139154 500 216 32 252 +134.15 +/- 14.15 0 10 52 182 6 +309.64 +/- 47.37
366801 ncm-dbt-06 1210507 500 220 35 245 +134.95 +/- 13.22 0 5 61 178 6 +312.48 +/- 44.19
366800 ncm-dbt-03 1233675 500 221 43 236 +129.35 +/- 13.72 0 4 73 164 9 +285.49 +/- 40.16
366799 ncm-dbt-02 1211443 500 218 33 249 +134.95 +/- 13.41 0 3 68 170 9 +304.07 +/- 41.65
366798 ncm-dbt-05 1199512 500 220 41 239 +130.14 +/- 12.59 0 2 72 171 5 +298.62 +/- 40.29
366797 ncm-dbt-06 1247838 500 224 32 244 +140.62 +/- 13.04 0 4 57 182 7 +330.23 +/- 45.79
366796 ncm-dbt-01 1142488 500 212 38 250 +126.17 +/- 13.6 0 7 67 171 5 +285.49 +/- 42.08
366795 ncm-dbt-03 1254509 500 216 30 254 +135.76 +/- 13.39 0 7 55 183 5 +318.25 +/- 46.48

Commit

Commit ID 6da1590de0980ca569827e2905f5b423e1a00a52
Author cj5716
Date 2024-05-05 11:04:37 UTC
Some history fixes and tidy-up This adds the functions `update_refutations` and `update_quiet_histories` to better distinguish the two. `update_quiet_stats` now just calls both of these functions. The functional side of this patch is two-fold: 1. Stop refutations being updated when we carry out multicut 2. Update pawn history every time we update other quiet histories Yellow STC: LLR: -2.95 (-2.94,2.94) <0.00,2.00> Total: 238976 W: 61506 L: 61415 D: 116055 Ptnml(0-2): 846, 28628, 60456, 28705, 853 https://tests.stockfishchess.org/tests/view/66321b5ed01fb9ac9bcdca83 However, it passed in <-1.75, 0.25> bounds: $ python3 sprt.py --wins 61506 --losses 61415 --draws 116055 --elo0 -1.75 --elo1 0.25 ELO: 0.132 +- 0.998 [-0.865, 1.13] LLR: 4.15 [-1.75, 0.25] (-2.94, 2.94) H1 Accepted Passed LTC: LLR: 2.94 (-2.94,2.94) <-1.75,0.25> Total: 399126 W: 100730 L: 100896 D: 197500 Ptnml(0-2): 116, 44328, 110843, 44158, 118 https://tests.stockfishchess.org/tests/view/66357b0473559a8aa857ba6f closes #5215 Bench 2370967
Copyright 2011–2024 Next Chess Move LLC