|
|
|
|
|
by moonchrome
1081 days ago
|
|
GPT 4 pre nerf was terrible at reviewing non-trivial or non textbook code. I've decided to test it for a few weeks by checking stuff I caught in review or as bugs, to see if it would spot it. It was like 0% on first try (would always talk about something irrelevant) and after leading it with follow up questions it would figure out the problem half of the time and half of the time I'd just give up leading it. These were tricky problems that were small scope - I've picked them so I could easily provide it to GPT for review. So I doubt larger context window will do much. |
|