Hacker News new | ask | show | jobs
by bayindirh 958 days ago
> We know that transformers can generalize within the training set.

> Of course it can't generalize beyond training - why would it?

4 out of 5 people I discussed this subject didn't know, and even believe that current LLMs are bound within their training set. They claimed that LLMs could synthesize data beyond their training set, and the resulting answers will never be wrong.

There's a large misunderstanding about how these things work, and LLM developers do not spend the effort to fix this misunderstanding since it helps to raise the hype even further.

2 comments

LLM's can, of course, synthesize data which is not in their training set.

Of course they can't create new facts, other than in principle, ones that can be derived from the training data.

I supposed generalization outside the training set structure may occur by chance should the outside set share enough of the same 'structure'. Basically if you can find magical maps to your training set then perhaps generalization may occur.