|
|
|
|
|
by theshrike79
318 days ago
|
|
This is because you haven't given it a tool to verify the task is done. TDD works pretty well, have it write even the most basic test (or go full artisanal and write it yourself) first and then ask it to implement the code. I have a standing order in my main CLAUDE.md to "always run `task build` before claiming a task is done". All my projects use Task[0] with pretty standard structure where build always runs lint + test before building the project. With a semi-robust test suite I can be pretty sure nothing major broke if `task build` completes without errors. [0] https://taskfile.dev |
|
Plus, this is all besides the point. Simon argued that the model hallucinates less, not a specific product.