New subject: [math-fun] can't win by resigning

13 Mar 2016

      Now that Lee Sedol has won game 4 of the 5-game match (having lost the
match yesterday with the third game), it looks like AlphaGo does indeed
need some fine-tuning in the end-game. When it's behind at the end it seems
to try cheap tricks that might work on kyu-level players, and make forcing
moves that delay the inevitable even while losing a few more points. It
eventually resigned - I don't know whether it's wired for that or whether
its trainers threw its virtual towel into the ring. When it's ahead it
makes what the commentator called "slack" moves - safe moves that take the
pressure off the opponent.

When I ran my chess program in several human tournaments in Pittsburgh (I
still have its USCF membership cards for a couple of years in the early
70's-- that's before they stopped accepting computers) it would thrash
around when it saw an inevitable mate coming, sacrificing material just to
push the mate out another move. Looked bizarre. I resigned for it several
times. Oh, the embarrassment. But the bug didn't seem worth fixing.

AlphaGo's play is interestingly different from humans in another way: its
evaluation ignores the amount of the win in order to maximize the
probability of a win. This led to the third game, for example, seeming
closer than it was - AlphaGo apparently had other aces up its conduits if
it had run into trouble. Many players will attack anything attackable just
to finish off the game. Bobby Fischer wasn't satisfied with winning his
chess games - he felt he had to crush his opponents.
...
From: Gareth McCaughan <gareth.mccaughan@pobox.com>
On 10/03/2016 17:19, Warren D Smith wrote:
...
Well, if software testers refused to try to find bugs due to "etiquette"
then there would be a lot more bugs.
My guess as to the actual question here: once the game
gets near the end, I suspect AlphaGo's tree search will
produce extremely strong play even if its neural networks
mess up. So even conditional on the scenario you describe
where AlphaGo's training hasn't equipped it to evaluate
things well in unusual positions, I think it's very
unlikely that playing on would have given Lee Sedol
a non-negligible extra chance of winning.
-- 
          Jim Gillogly

Re: [math-fun] can't win by resigning

Jim Gillogly

Eric Angelini

Dan Asimov

Michael Greenwald

Eric Angelini

tags

participants (4)