Hacker News new | ask | show | jobs
by thaumasiotes 456 days ago
> Also important was the use [in Go] of learning by self play to learn a value function

I thought the self-play was the value function that made progress in Go. That is, it wasn't the case that we played through a lot of games and used that data to create a function that would assign a value to a Go board. Instead, the function to assign a value to a Go board would do some self-play on the board and assign value based on the outcome.