Y
Hacker News
new
|
ask
|
show
|
jobs
by
tootyskooty
266 days ago
would be nice to finally see multi-turn coding benchmarks. everything we have so far is single-turn and that's clearly not a realistic scenario.