Y
Hacker News
new
|
ask
|
show
|
jobs
by
Eric_Xua
61 days ago
Love the idea of turning agent benchmarks into a real-time Bomberman match between LLMs — super fun way to surface speed vs reasoning tradeoffs.