Hacker News new | ask | show | jobs
by nullc 1207 days ago
Unless your xml input was very small you probably exceeded it's input window.

I suspect chatgpt does some summarization under the covers but otherwise it still only has a finite and fairly small lookback.

1 comments

I suspect you're right there. Yet it forged ahead confidently giving answers anyway!
Right, but consider its 'evaluation' during training: During training it is constantly seeing stuff where the context is out of the window and the correct completion confidently answers, so the model is trained to do the same.

I think this is very tricky to solve conceptually (since the human authors don't have the same input event horizon problem), but it could be (and has been) papered over by making the context bigger.