| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by Sol2Sol 979 days ago

Been following the story, mostly on twitter don't know if any of this is true but at this point it is just pure entertainment reading some of the speculative theories:

https://twitter.com/YourFirstAI/status/1727661862781612215

If Q* is what I think it is… this is absolutely BONKERS

Q-learning has existed for decades already. It’s just a basic reinforcement learning algorithm. A* is also fairly old- it’s a heuristic-based path finding algorithm.

In typical engineering fashion, they may have found an intersection of the 2 and named it Q*. This is total speculation, but if this is a “breakthrough” that means OAI built an algorithm that can feed a highly efficient heuristic into Q-learning. That is MASSIVE.

1 comments

wegfawefgawefg 979 days ago

to tone down your enthusiasm a bit, heuristic tree search and q learning were already mixed years ago. Thats what Alphago and Alphazero were.

link

gmerc 979 days ago

Which Ilya was involved with...

link