Hacker News new | ask | show | jobs
by greesil 36 days ago
How do you know this? I'm not trying to attack your statement, I am genuinely curious how anyone knows anything about model performance outside of benchmarks that are already in the training set.
1 comments

using them you kind of get a feeling for skill level and can extrapolate that better than juiced benchmarks.