Y
Hacker News
new
|
ask
|
show
|
jobs
by
vanderZwan
1103 days ago
>
so it is almost certainly getting the wrong answers
Not checking the correctness of the output sounds like a pretty bad oversight for a benchmark