Hacker News new | ask | show | jobs
by thewataccount 1180 days ago
> You're teaching your model to copy an other model's outputs.

Which itself was trained on human outputs to do the same thing.

Very soon it will be full Ouroboros as humans use the model's output to finetune themselves.