Y
Hacker News
new
|
ask
|
show
|
jobs
Show HN: Agentic Arena – 52 tasks implemented by Opus 4.5, Gemini 3, and GPT-5.1
(
arena.logic.inc
)
1 points
by
sgk284
208 days ago
1 comments
lostmsu
208 days ago
How does one vote? The name of the model that made the game should be hidden.
Is there a leaderboard?
link
sgk284
208 days ago
We put this together mostly just to do side-by-side comparisons, though you make a good point. It'd be fun to blind-vote on your favorite impl.
link
Is there a leaderboard?