Hacker News new | ask | show | jobs
by axpy906 59 days ago
Once the bench is public it’s out and probably in the training data. Better to have your own and test it on a new model.