Y
Hacker News
new
|
ask
|
show
|
jobs
by
chaosadm
211 days ago
The game environment looks pretty neat. Not surprised to see LLMs struggling but with a benchmark to focus new techniques on, I am excited how some of the new solutions trying to top the leaderboard would do.