| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by kayodelycaon 304 days ago
	I suspect Reddit is a major source of their training material. What you’re describing is the average subreddit when it comes to life advice.

4 comments

gooodvibes 304 days ago

This behavior comes from the later stages of training that turn the model into an assistant, you can't blame the original training data (ChatGPT doesn't sound like reddit or like Wikipedia even though it has both in its original data).

link

morpheos137 300 days ago

It is shocking to me that 99% of people on YC news don't understand that LLMs encode tokens not verbatim training data. This is why I don't understand the NYT lawsuit against openAI. I can't see ChatGPT reproducing any text verbatim. Rather it is fine grained encoding of style in a multitude of domains. Again LLMs do not contain training data, they are a lossy compression of what the training data looks like.

link

password321 304 days ago

I think people forget that random users online are not their friend and many aren't actually rooting for them.

link

ThunderSizzle 304 days ago

Exactly the problem. Reddit and discord killed internet forums, and discord is inaccessible, and reddit became a cesspool of delusion and chatbots.

link

kayodelycaon 304 days ago

Reddit was a cesspool before social media became big.

link

morpheuskafka 303 days ago

Most reddit comments are rather sarcastic though, certainly not sycophantically answering the OP like the way the GPT model has become over time.

link

rsynnott 303 days ago

Eh, some of the "AITA"-type subreddits do seem to have a culture of, ah, giving the asker _way_ too much benefit of the doubt.

link