Y
Hacker News
new
|
ask
|
show
|
jobs
by
aixpert
129 days ago
The article basically claims that LLMs are bad at politics and poker which is both not true (at least if they receive some level of reinforcement learning after sweep training)
1 comments
conradkay
129 days ago
Top LLMs are still very bad at poker, see this breakdown of a recent Kaggle experiment: <
https://www.youtube.com/watch?v=jyv1bv7JKIQ
>
What do you mean by sweep training here?
link
What do you mean by sweep training here?