Hacker News new | ask | show | jobs
by johnjudeh 41 days ago
Thanks for sharing! It’s way easier to build an agent that can complete a task than to make sure it works across all the cases you care about. Especially when the output quality is really subjective