| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by idle_zealot 768 days ago
	> That is very different from these models that are just rewarded for mimicking regardless if it is right or wrong That's not a totally accurate characterization. The base models are just trained to predict plausible text, but then the models are fine-tuned on instruct or chat training data that encourages a certain "attitude" and correctness. It's far from perfect, but an attempt is certainly made to train them to be right.

1 comments

Jensson 768 days ago

They are trained to replicate text semantically and then given a lot of correct statements to replicate, that is very different from being trained to be correct. That makes them more useful and less incorrect, but they still don't have a concept of correctness trained into them.

link