Hacker News new | ask | show | jobs
by abmfy 234 days ago
I spent 2~3 hours setting up, most of the time was spent on writing the evaluator

Actually I think the evaluator will be the most important part for the whole pipeline to work

1 comments

Yes, getting the right workloads and ensuring correctness are crucial parts of the process