| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by romesc 934 days ago
	Sure A* is awesome, but taking the "star" and immediately attributing it to A* is probably a bridge too far. Q* or any X* for that matter is extremely common for referring to the optimal function under certain assumptions. (usually cost / reward structure).

1 comments

tunesmith 934 days ago

Yeah I just saw the video from that researcher (later an OpenAI researcher?) that talked about it back in 2016... not that I understood much, but it definitely seemed that Q* was a generalization of the Q algorithm described on the previous slide. The optimum something across all somethings.

link

resource0x 934 days ago

LeCun: Please ignore the deluge of complete nonsense about Q*. https://twitter.com/ylecun/status/1728126868342145481

link

zaptrem 933 days ago

As someone with a borderline acceptable understanding of RL this is the most accurate take so far.

link

maaaaattttt 934 days ago

If you have the possibility I would be quite interested in a link to the video or alternatively the name of the researcher you mention.

link

andrew3726 933 days ago

It's Noam Brown, he worked at Meta AI before on Cicero and No-hands Poker before that.

link