Hacker News new | ask | show | jobs
by jpollock 302 days ago
I gave it explicit ordering, instructions on what tools to _not_ use, and before/after examples from the codebase. A full page of instructions.

After iterating on that for a while, I did a bunch manually (90) and then gave the LLM a list of pull requests as examples, and asked _it_ to write the prompt. It still failed.

Finally, I broke the problem up and started to ask it to generate tools to perform each step. It started to make progress - each execution gave me a new checkpoint so it wouldn't make new mistakes.

1 comments

Yep, I think you did everything that's reasonable. I'm surprised myself only because I've been able to have Cursor do similar things for my codebase with no issues. Granted it's a react codebase following fairly standard practices.