Hacker News new | ask | show | jobs
by pxeger1 925 days ago
But AlphaGo etc don’t use any kind of language-based AI, so LLMs (which this thread was about) are no good.
1 comments

The next step seems to be applying past advances in reinforcement learning with modern transformer based models
Which multiple teams are working on - OpenAI (Q*), and Meta just released a reinforcement learning framework
Could you point me towards Meta's reinforcement learning framework? I'd like to see how it stacks up against the OpenAI gym.
Thank you!