Hacker News new | ask | show | jobs
by berkut 959 days ago
I haven't tried it yet, but people in the /r/chatgpt subreddit are claiming GPT-4-Turbo seems to have issues with understanding/remembering longer (say 100 lines) of code, whereas 3.5 and 4.0 seem to have handled things a bit better, implying that the context-window size isn't (currently) as large as claimed.

Anyone else seeing any evidence of this?

2 comments

Some specialized subreddits can be incredibly useful. /r/chatgpt due to its popularity is not one of those.

It's full of memes and people complaining its not as "good" as it was yesterday when it fails at completing their homework.

I would take anything said there with a big grain of salt, and stick to benchmarks.

The context window IS longer, but it's less powerful. Obviously, they can't afford to have full transformer context over the entire context. That would be an impossibly large amount of ram. They're using some combination of sliding window/cyclical/or some other adjusted attention mechanism likely with some degree of summarization in some manner.