Hacker News new | ask | show | jobs
by vanderZwan 1103 days ago
> so it is almost certainly getting the wrong answers

Not checking the correctness of the output sounds like a pretty bad oversight for a benchmark