Hacker News new | ask | show | jobs
by esafak 211 days ago
No, thanks. Just use evals with error bars. If you can't get error bars, use an A/B test to detect spuriousness and evals.