| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by belter 593 days ago
	Is this paper wrong? - https://arxiv.org/abs/2311.09807

1 comments

simonw 592 days ago

It shows that if you deliberately train LLMs against their own output in a loop you get problems. That's not what synthetic data training does.

link

belter 592 days ago

I understand and appreciate your clarification. However would it not be the case some synthetic data strategies, if misapplied, can resemble the feedback loop scenario and thus risk model collapse?

link