Hacker News new | ask | show | jobs
by Closi 754 days ago
The original comment says nothing about benchmarking, they just say that an AI can’t one shot their complex task?
1 comments

When I read

"My favorite thing to ask the models designed for programming is ....... None of them ever get it right"

I read "benchmark".