NCM plays each Stockfish dev build 20,000 times against Stockfish 15. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.
| Host | Duration | Avg Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo |
|---|---|---|---|---|---|---|---|
| ncm-dbt-01 | 00:21:47 | 585590 | 214 | 8 135 71 | -237.28 ± 28.89 | 29 69 9 0 0 | -542.97 ± 132.2 |
| ncm-dbt-02 | 00:21:05 | 584916 | 216 | 3 139 74 | -257.37 ± 28.14 | 32 72 4 0 0 | -689.62 ± 315.22 |
| ncm-dbt-03 | 00:22:13 | 585928 | 220 | 1 139 80 | -256.01 ± 26.57 | 31 76 3 0 0 | -743.62 ± 294.56 |
| ncm-dbt-04 | 00:22:13 | 568355 | 222 | 3 131 88 | -228.36 ± 22.77 | 22 84 5 0 0 | -654.93 ± 225.83 |
| ncm-dbt-05 | 00:22:12 | 583782 | 224 | 5 149 70 | -265.09 ± 30.77 | 38 68 6 0 0 | -624.06 ± 184.63 |
| 1096 | 20 693 383 | -248.55 ± 12.25 | 152 369 27 0 0 | -639.03 ± 68.64 | |||
| ID | Host | Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo | CLI | PGN | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 462055 | ncm-dbt-02 | 584916 | 216 | 3 139 74 | -257.37 ± 28.14 | 32 72 4 0 0 | -689.62 ± 315.22 | ||||
cutechess-cli \
-rounds 266 \
-games 2 \
-concurrency 16 \
-srand 3116584454 \
-pgnout ncm-dbt-20200808-0635-005.pgn \
-openings \
file=UHO_4060_v2.epd \
format=epd \
order=random \
-repeat \
-resign \
movecount=3 \
score=600 \
-draw \
movenumber=34 \
movecount=8 \
score=5 \
-each \
tc=30+0.3 \
timemargin=10000 \
proto=uci \
option.Hash=128 \
option.Threads=8 \
-engine \
name=20200808-0635 \
cmd=docker \
arg=run \
arg=-i \
arg=--rm \
arg=--entrypoint=/engine \
arg=dev_build:910f779eb1f432c3f90fc19c7824840e02cac837 \
-engine \
name=sf15 \
cmd=docker \
arg=run \
arg=-i \
arg=--rm \
arg=--entrypoint=/engine \
arg=stockfish:15
|
|||||||||||
| 462054 | ncm-dbt-04 | 568355 | 222 | 3 131 88 | -228.36 ± 22.77 | 22 84 5 0 0 | -654.93 ± 225.83 | ||||
cutechess-cli \
-rounds 266 \
-games 2 \
-concurrency 16 \
-srand 2511747940 \
-pgnout ncm-dbt-20200808-0635-004.pgn \
-openings \
file=UHO_4060_v2.epd \
format=epd \
order=random \
-repeat \
-resign \
movecount=3 \
score=600 \
-draw \
movenumber=34 \
movecount=8 \
score=5 \
-each \
tc=30+0.3 \
timemargin=10000 \
proto=uci \
option.Hash=128 \
option.Threads=8 \
-engine \
name=20200808-0635 \
cmd=docker \
arg=run \
arg=-i \
arg=--rm \
arg=--entrypoint=/engine \
arg=dev_build:910f779eb1f432c3f90fc19c7824840e02cac837 \
-engine \
name=sf15 \
cmd=docker \
arg=run \
arg=-i \
arg=--rm \
arg=--entrypoint=/engine \
arg=stockfish:15
|
|||||||||||
| 462053 | ncm-dbt-01 | 585590 | 214 | 8 135 71 | -237.28 ± 28.89 | 29 69 9 0 0 | -542.97 ± 132.2 | ||||
cutechess-cli \
-rounds 266 \
-games 2 \
-concurrency 16 \
-srand 525316835 \
-pgnout ncm-dbt-20200808-0635-003.pgn \
-openings \
file=UHO_4060_v2.epd \
format=epd \
order=random \
-repeat \
-resign \
movecount=3 \
score=600 \
-draw \
movenumber=34 \
movecount=8 \
score=5 \
-each \
tc=30+0.3 \
timemargin=10000 \
proto=uci \
option.Hash=128 \
option.Threads=8 \
-engine \
name=20200808-0635 \
cmd=docker \
arg=run \
arg=-i \
arg=--rm \
arg=--entrypoint=/engine \
arg=dev_build:910f779eb1f432c3f90fc19c7824840e02cac837 \
-engine \
name=sf15 \
cmd=docker \
arg=run \
arg=-i \
arg=--rm \
arg=--entrypoint=/engine \
arg=stockfish:15
|
|||||||||||
| 462052 | ncm-dbt-05 | 583782 | 224 | 5 149 70 | -265.09 ± 30.77 | 38 68 6 0 0 | -624.06 ± 184.63 | ||||
cutechess-cli \
-rounds 266 \
-games 2 \
-concurrency 16 \
-srand 182905689 \
-pgnout ncm-dbt-20200808-0635-002.pgn \
-openings \
file=UHO_4060_v2.epd \
format=epd \
order=random \
-repeat \
-resign \
movecount=3 \
score=600 \
-draw \
movenumber=34 \
movecount=8 \
score=5 \
-each \
tc=30+0.3 \
timemargin=10000 \
proto=uci \
option.Hash=128 \
option.Threads=8 \
-engine \
name=20200808-0635 \
cmd=docker \
arg=run \
arg=-i \
arg=--rm \
arg=--entrypoint=/engine \
arg=dev_build:910f779eb1f432c3f90fc19c7824840e02cac837 \
-engine \
name=sf15 \
cmd=docker \
arg=run \
arg=-i \
arg=--rm \
arg=--entrypoint=/engine \
arg=stockfish:15
|
|||||||||||
| 462051 | ncm-dbt-03 | 585928 | 220 | 1 139 80 | -256.01 ± 26.57 | 31 76 3 0 0 | -743.62 ± 294.56 | ||||
cutechess-cli \
-rounds 266 \
-games 2 \
-concurrency 16 \
-srand 1553126316 \
-pgnout ncm-dbt-20200808-0635-001.pgn \
-openings \
file=UHO_4060_v2.epd \
format=epd \
order=random \
-repeat \
-resign \
movecount=3 \
score=600 \
-draw \
movenumber=34 \
movecount=8 \
score=5 \
-each \
tc=30+0.3 \
timemargin=10000 \
proto=uci \
option.Hash=128 \
option.Threads=8 \
-engine \
name=20200808-0635 \
cmd=docker \
arg=run \
arg=-i \
arg=--rm \
arg=--entrypoint=/engine \
arg=dev_build:910f779eb1f432c3f90fc19c7824840e02cac837 \
-engine \
name=sf15 \
cmd=docker \
arg=run \
arg=-i \
arg=--rm \
arg=--entrypoint=/engine \
arg=stockfish:15
|
|||||||||||
| Commit ID | 910f779eb1f432c3f90fc19c7824840e02cac837 |
|---|---|
| Author | Vizvezdenec |
| Date | 2020-08-08 06:35:47 UTC |
|
Do more futility pruning for parent nodes.
This patch increases LMRdepth threshold for futility pruning at parent nodes so it can apply more often.
With radical change to evaluation approach it seems that search is really far from optimal state, especially it parts that use static evaluation of position.
passed STC
https://tests.stockfishchess.org/tests/view/5f2da75661e3b6af64881fd0
LLR: 2.93 (-2.94,2.94) {-0.50,1.50}
Total: 8744 W: 1305 L: 1156 D: 6283
Ptnml(0-2): 75, 789, 2500, 928, 80
passed LTC
https://tests.stockfishchess.org/tests/view/5f2dcb2a61e3b6af64881ff3
LLR: 2.98 (-2.94,2.94) {0.25,1.75}
Total: 17728 W: 1256 L: 1117 D: 15355
Ptnml(0-2): 22, 961, 6774, 1070, 37
Bench: 4067325
|
|