|
|
|
|
|
by seanmcdirmid
153 days ago
|
|
I’m pretty sure this is already part of the training loop even if it isn’t coming from the internet. It is definitely used for fine tuning and distillation. As for how LLM producers avoid model collapse, they curate and filter. |
|