Hacker News new | ask | show | jobs
by misnome 74 days ago
Or, ask it to make a plan, and it makes a good plan! It explicitly notes how validation is to take place on each stage!

And then does every stage without running any of the validation. It's your agent's plan, it should probably be generated in a way that your own agent can follow it.