| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mtrycz2 1182 days ago
	> you guessed it, the word was glyph My pet conspiracy theory is that is is wired to please the user, to get better coverage from the media and social media.

2 comments

tome 1182 days ago

I think there's something to this. I have a theory that LLMs are implicitly trained primarily to impress people, since that's what motivates those who work on them, excites the general public, and convinces conferences to publish papers.

link

evnc 1177 days ago

In a sense, this is exactly what RLHF is, right?

link

tome 1177 days ago

I'm thinking of something at a larger scale. In some sense models that "wow" society get more interest and funding.

link

codetrotter 1182 days ago

I don’t think so. In Wordle you have to guess the word in six attempts. It’s a fun game and often simple.

So it could be that ChatGPT picked up on a pattern in the training data where after a couple of guesses, a lot of the time people pick the right word.

So statistically it might go like. Guess a word. Probably not the right one. Guess a couple more and suddenly it’s statistically likely to be the correct word, and because of that the LLM ends up outputting the congrats and so on

link