Hacker News new | ask | show | jobs
by smy20011 96 days ago
It interesting to see that the eval set becoming more and more expensive. Previously we just need to evaluate one test set, right now we need to create a lot of diffs and run a lot of tests.