| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by nullc 1207 days ago
	Unless your xml input was very small you probably exceeded it's input window. I suspect chatgpt does some summarization under the covers but otherwise it still only has a finite and fairly small lookback.

1 comments

jamiethompson 1207 days ago

I suspect you're right there. Yet it forged ahead confidently giving answers anyway!

link

nullc 1204 days ago

Right, but consider its 'evaluation' during training: During training it is constantly seeing stuff where the context is out of the window and the correct completion confidently answers, so the model is trained to do the same.

I think this is very tricky to solve conceptually (since the human authors don't have the same input event horizon problem), but it could be (and has been) papered over by making the context bigger.

link