|
|
|
|
|
by epolanski
307 days ago
|
|
I will also state another semi-obvious thing that people seem to consistently forget: models are non deterministic. You are not going to get the same output from GPT5 or Sonnet every time. And this obviously compounds across many different steps. E.g. give GPT5 the code to a feature (by pointing some files and tests) and tell it to review it and find improvement opportunities and write them down: depending on the size of the code, etc, the answers will slightly different. I often do it in Cursor by having multiple agents review a PR and each of them:
- has to write down their pr-number-review-model.md (e.g. pr-15-review-sonnet4.md)
- has to review the reviews of the other files Then I review it myself and try to decide what's valuable in there and what not. And to my disappointment (towards myself):
- often they do point to valid flaws I would've not thought about
- miss the "end-to-end" or general view of how the code fits in a program/process/business. What do I mean: sometimes the real feedback would be that we don't need it at all. But you need to have these conversations with AI earlier. |
|