Hacker News new | ask | show | jobs
by CGamesPlay 1178 days ago
As a baseline, don't! If it performs horribly on the test and it cheated, that's even worse than if it fails the test and didn't cheat. So the benchmark score gives you an upper bound on performance.