Dev Builds » 20140203-2037

You are viewing an old NCM Stockfish dev build test. You may find the most recent dev build tests using Stockfish 15 as the baseline here.

Use this dev build

NCM plays each Stockfish dev build 20,000 times against Stockfish 14. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.

Summary

Host	Duration	Avg Base NPS	Games	WLD	Standard Elo	Ptnml(0-2)	Gamepair Elo

Test Detail

ID	Host	Base NPS	Games	WLD	Standard Elo	Ptnml(0-2)	Gamepair Elo	CLI	PGN

Commit

Commit ID	e88ef801af3ac5400c1b82d009666fb082c24a16
Author	Lucas Braesch
Date	2014-02-03 20:37:14 UTC
Better document razoring Use ralpha instead of rbeta * rbeta is confusing people. It took THREE attempts to code razoring at PV nodes correctly in a recent test, because of the rbeta trick. Unnecessary tricks should be avoided. * The more correct and self-documenting way of doing this, is to say that we use a zero window around alpha-margin, not beta-margin. The fact that, because we only do it at PV nodes, alpha happens to be beta-1 and that the current stuff with rbeta works, may be correct, but is confusing. Remove the misleading and partially erroneous comment about returning v + margin: * comments should explain what the code does, not what it could have done. * this comment is partially wrong in saying that v+margin is "logical", and that it is "surprising" that is doesn't work. From a theoretical perspective, at least 3 ways of doing this are equally defendable: 1/ fail hard: return alpha: The most conservative. We bet that the search will fail low, but we don't know by how much and don't want to take risks. 2/ aggressive fail soft: return v (what the current code does). This corresponds to normal fail soft, with the added assumption that we don't care about the reduction effect (see below point 3/) 3/ conservative fail soft: return v + margin. If the reduced search (qsearch) gives us a score <= v, we bet that the non reduced search will give us a score <= v + margin. * Saying that 2/ is "logical" implies that 1/ and 3/ are not, which is arguably wrong. Besides, experimental results tell us that 2/ beats 3/, and that's not something we can argue against: experimental results are the only trusted metric. * Also, with the benefit of hindsight, I don't think the fact that 2/ is better than 3/ is surprising at all. The point is that it is YOUR turn to move, and you are assuming that by NOT playing (and letting the opponent capture your hanging pieces in QS) you cannot generally GAIN razor_margin(depth). No functional change.