Y
Hacker News
new
|
ask
|
show
|
jobs
by
mrbungie
492 days ago
Give a group of "average human" two years, give or take 6 months, and they will also saturate the benchmark and probably some humans would beat the SOTA LLM/RLM.
People tend to do so all the time, with games for example.
1 comments
aoeusnth1
492 days ago
Average humans cannot be copy-pasted.
link
daveguy
490 days ago
Average companies also don't pay humans to complete a benchmark consisting of a fixed set of problems.
link