| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by pavel_lishin 832 days ago
	What happens if you train a model on nothing but AI-generated output, recursively? Does it eventually get inbred?

5 comments

visarga 832 days ago

Why would you limit a model to be like a brain in a vat? Instead let the model out so people use it, then use the chat logs to fine-tune. A chat room is a kind of environment, there is a human, maybe some tools. The LLM text will generate feedback and right there is a learning signal.

Even without a human, if a LLM has access to code execution it can practice solving coding tasks with runtime feedback. There are many ways a LLM could obtain useful learning signals. After all, we got all our knowledge from the environment as well, in the end there is no other source for knowledge and skills.

link

Der_Einzige 832 days ago

I want to observe that one of my favorite youtubers did exactly this with making the "uppest case" and "lowest case" letters.

https://www.youtube.com/watch?v=HLRdruqQfRk

I love this guy so much and wish he made far more videos.

link

astrange 832 days ago

Depends how good the AI output is, just like it depends how good the natural output is.

If most of it is bad but you can get a better AI to tag it as bad, then it's not necessarily a problem.

link

Kuinox 832 days ago

Without human input, yes.

link

gwern 832 days ago

Does AlphaZero get inbred?

link