Y
Hacker News
new
|
ask
|
show
|
jobs
by
pxeger1
925 days ago
But AlphaGo etc don’t use any kind of language-based AI, so LLMs (which this thread was about) are no good.
1 comments
thisismyswamp
925 days ago
The next step seems to be applying past advances in reinforcement learning with modern transformer based models
link
mattsan
925 days ago
Which multiple teams are working on - OpenAI (Q*), and Meta just released a reinforcement learning framework
link
npsomaratna
925 days ago
Could you point me towards Meta's reinforcement learning framework? I'd like to see how it stacks up against the OpenAI gym.
link
mattsan
924 days ago
Sure thing -
https://pearlagent.github.io/
HN post here:
https://news.ycombinator.com/item?id=38564526
link
npsomaratna
924 days ago
Thank you!
link