| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by jrmg 155 days ago

I wonder where LLMs got this style from, since, while you did see something like it, it wasn’t nearly as widespread before the rise of ChatGPT, so not the most statistically likely form for these to be generated in.

Was it trained into them in the supervised or reinforcement phases?

Has it emerged from prompts extorting them to be friendly, somehow sneaking in from text message training as a result?

Is it now in a self-reinforcing feedback loop as training data grows to include modern Readmes?

2 comments

sdwr 154 days ago

As far as I know, it was trained in because people preferred it in A/B testing

link

LtWorf 154 days ago

Private chats maybe?

link