Hacker News new | ask | show | jobs
by kokakiwi 68 days ago
I think the problem being given to Codex for the benchmark is the one in the attached video, where two Codex run side-by-side, working a "standard" dev thingy