Y
Hacker News
new
|
ask
|
show
|
jobs
by
refulgentis
1030 days ago
Well, no, we have the HumanEval results for the June release.
1 comments
somenameforme
1029 days ago
Which is both (1) a subjective selection to measure the effectiveness of various chatbots and (2) now subject to gaming from companies using opaque/closed/inaccessible/unverifiable systems, like
Open
AI.
link