|
|
|
|
|
by aurareturn
123 days ago
|
|
No, OP said he used the Max Opus 4.6. Anyways, I think one area where Codex and Claude Code falls short is that they do not test the changes they made by using the app. In this case, the LLM should ideally render the page in a real browser, and actually click on the buttons to verify. Best if the LLM test it before the changes, and then after so that it is the same. Maybe it should take a screenshot of before the change, then take a screenshot after. And match. I asked why Codex and Claude don't do this here: https://news.ycombinator.com/item?id=46792066 |
|