| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by hausrat 66 days ago
	This has very little to do with someone making the LLM too human but rather a core limitation of the transformer architecture itself. Fundamentally, the model has no notion of what is normal and what is exceptional, its only window into reality is its training data and your added prompt. From the perspective of the model your prompt and its token vector is super small compared to the semantic vectors it has generated over the course of training on billions of data points. How should it decide whether your prompt is actually interesting novel exploration of an unknown concept or just complete bogus? It can't and that is why it will fall back on output that is most likely (and therefore most likely average) with respect to its training data.

4 comments

winddude 66 days ago

> This has very little to do with someone making the LLM too human but rather a core limitation of the transformer architecture itself.

It has almost everything to do with it. Models have been fine-tuned to generate outputs that humans prefer.

link

zzzeek 66 days ago

the best thing it could do, and once in awhile it does, is say, "hey that's really not a great way to do this and I'm not sure I could really make that work"

ive had very long sessions with LLMs that obviously didnt know how to do something where i keep trying to get it to stop going in circles, but these days I have become attuned to noticing the "it's going in circles" pattern quickly, which is basically how it communicates "sorry I dont really know how to do that".

link

anuramat 66 days ago

wdym by "prompt and vector is small"? small as in "less tokens"? that should be a positive thing for any kind of estimation

in any case, how is this specific to transformers?

link

chrisjj 66 days ago

> How should it decide whether your prompt is actually interesting novel exploration of an unknown concept or just complete bogus?

It shouldn't. It should just do what it is told.

link

philipwhiuk 66 days ago

Remember that all it's actually 'doing' is predicting more text.

link