Y
Hacker News
new
|
ask
|
show
|
jobs
by
rvnx
205 days ago
Seems difficult to believe, considering the number of people who prepare this dataset, who also work(ed) or hold shares in Google or OpenAI, etc.
1 comments
menaerus
204 days ago
So everybody is cheating in your mind? We can't trust anything? How about taking a more balanced take: there's certainly some progress, and while the benchmark results most likely don't represent the world reality, the progress is continuous.
link