| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by pxeger1 925 days ago
	But AlphaGo etc don’t use any kind of language-based AI, so LLMs (which this thread was about) are no good.

1 comments

The next step seems to be applying past advances in reinforcement learning with modern transformer based models

Which multiple teams are working on - OpenAI (Q*), and Meta just released a reinforcement learning framework

Could you point me towards Meta's reinforcement learning framework? I'd like to see how it stacks up against the OpenAI gym.

Thank you!