| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by thisismyswamp 966 days ago
	Playing chess & go is also search in a large tree of moves leading to particular game states

2 comments

pxeger1 966 days ago

But AlphaGo etc don’t use any kind of language-based AI, so LLMs (which this thread was about) are no good.

link

thisismyswamp 966 days ago

The next step seems to be applying past advances in reinforcement learning with modern transformer based models

link

mattsan 966 days ago

Which multiple teams are working on - OpenAI (Q*), and Meta just released a reinforcement learning framework

link

npsomaratna 966 days ago

Could you point me towards Meta's reinforcement learning framework? I'd like to see how it stacks up against the OpenAI gym.

link

mattsan 965 days ago

Sure thing - https://pearlagent.github.io/

HN post here: https://news.ycombinator.com/item?id=38564526

link

npsomaratna 965 days ago

Thank you!

link

greysphere 966 days ago

The final state in chess is a single* state which yes, then branches out to N checkmate configurations and then N*M one-move-from-checkmates, and so on. (*Technically it's won/lost/draw.)

The equivalent final state in theorem proving is unique to each theorem so such a system would need to handle an additional layer-of-generalization.

link

ChainOfFools 966 days ago

Is this how some of the more advanced chess engines work, or even the not so advanced ones, where there's a point at which it stops searching the forward move tree in greatest depth, and instead starts searching backwards from a handful of plausible (gross move limit-bound) checkmate states looking for an intersection with a shallow forward search state?

link

zone411 966 days ago

Kind of, but it's calculated offline and then just accessed during the game: https://www.chessprogramming.org/Endgame_Tablebases

link