Hacker News new | ask | show | jobs
by refulgentis 1030 days ago
Well, no, we have the HumanEval results for the June release.
1 comments

Which is both (1) a subjective selection to measure the effectiveness of various chatbots and (2) now subject to gaming from companies using opaque/closed/inaccessible/unverifiable systems, like OpenAI.